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EPITOPE SEQUENCES 

Field of the Invention 

The present invention generally relates to peptides, and nucleic acids encoding peptides, 
that are useful epitopes of target-associated antigens. More specifically, the invention relates to 
5 epitopes that have a high affinity for MHC class I and that are produced by target-specific 

proteasomes. 

Description of the Related Art 

Neoplasia and the Immune System 

The neoplastic disease state commonly known as cancer is thought to result generally from 
10 a single cell growing out of control The uncontrolled growth state typically results from a multi- 
step process in which a series of cellular systems fail, resulting in the genesis of a neoplastic cell. 
The resulting neoplastic cell rapidly reproduces itself, forms one or more tumors, and eventually 
may cause the death of the host. 

Because the progenitor of the neoplastic cell shares the host's genetic material, neoplastic 
15 cells are largely unassailed by the host's immune system. During immune surveillance, the process 

in which the host's immune system surveys and localizes foreign materials, a neoplastic cell will 
appear to the host's immune surveillance machinery as a "self cell. 

Viruses and the Immune System 

In contrast to cancer cells, virus infection involves the expression of clearly non-self 

20 antigens. As a result, many virus infections are successfully dealt with by the immune system with 

minimal clinical sequela. Moreover, it has been possible to develop effective vaccines for many of 
those infections that do cause serious disease. A variety of vaccine approaches have been used 
successfully to combat various diseases. These approaches include subunit vaccines consisting of 
individual proteins produced through recombinant DNA technology. Notwithstanding these 

25 advances, the selection and effective administration of minimal epitopes for use as viral vaccines 

has remained problematic. 

In addition to the difficulties involved in epitope selection stands the problem of viruses 
that have evolved the capability of evading a host's immune system. Many viruses, especially 
viruses that establish persistent infections, such as members of the herpes and pox virus families, 

30 produce immunomodulatory molecules that permit the virus to evade the host's immune system. 

The effects of these immunomodulatory molecules on antigen presentation may be overcome by 
the targeting of select epitopes for administration as immunogenic compositions. To better 
understand the interaction of neoplastic cells and virally infected cells with the host's immune 
system, a discussion of the system's components follows below. 

35 The immune system functions to discriminate molecules endogenous to an organism 

("self' molecules) from material exogenous or foreign to the organism ("non-self molecules). The 
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immune system has two types of adaptive responses to foreign bodies based on the components that 
mediate the response: a humoral response and a cell-mediated response. The humoral response is 
mediated by antibodies, while the cell-mediated response involves cells classified as lymphocytes. 
Recent anticancer and antiviral strategies have focused on mobilizing the host immune system as a 
means of anticancer or antiviral treatment or therapy. 

The immune system functions in three phases to protect the host from foreign bodies: the 
cognitive phase, the activation phase, and the effector phase. In the cognitive phase, die immune 
system recognizes and signals the presence of a foreign antigen or invader in the body. The foreign 
antigen can be, for example, a cell surface marker from a neoplastic cell or a viral protein. Once the 
system is aware of an invading body, antigen specific cells of the immune system proliferate and 
differentiate in response to the invader-triggered signals. The last stage is the effector stage in which 
the effector cells of the immune system respond to and neutralize the detected invader. 

An array of effector cells implements an immune response to an invader. One type of 
effector cell, the B cell, generates antibodies targeted against foreign antigens encountered by the 
host hi combination with the complement system, antibodies direct the destruction of cells or 
organisms bearing the targeted antigen. Another type of effector cell is the natural killer cell (NK 
cell), a type of lymphocyte having the capacity to spontaneously recognize and destroy a variety of 
virus infected cells as well as malignant cell types. The method used by NK cells to recognize target 
cells is poorly understood. 

Another type of effector cell, die T cell, has members classified into three subcategories, each 
playing a different role in the immune response. Helper T cells secrete cytokines which stimulate the 
proliferation of other cells necessary for mounting an effective immune response, while suppressor T 
cells down-regulate the immune response. A third category of T cell, the cytotoxic T cell (CTL), is 
capable of directly lysing a targeted cell presenting a foreign antigen on its surface. 

The Major Histocompatibility Complex and T Cell Target Recognition 

T cells are antigen-specific immune cells that function in response to specific antigen signals. 
B lymphocytes and the antibodies they produce are also antigen-specific entities. However, unlike B 
lymphocytes, T cells do not respond to antigens in a free or soluble form. For a T cell to respond to 
an antigen, it requires the antigen to be processed to peptides which are then bound to a presenting 
structure encoded in the major histocompatibility complex (MHC). This requirement is called "MHC 
restriction" and it is the mechanism by which T cells differentiate "self 5 from "non-self cells. If an 
antigen is not displayed by a recognizable MHC molecule, the T cell will not recognize and act on the 
antigen signal. T cells specific for a peptide bound to a recognizable MHC molecule bind to these 
MHC-peptide complexes and proceed to the next stages of the immune response. 

There are two types of MHC, class I MHC and class H MHC. T Helper cells (CD4*) 
predominately interact with class II MHC proteins while cytolytic T cells (CD8 4 ) predominately 
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interact with class I MHC proteins. Both classes of MHC protein are transmembrane proteins with a 
majority of their structure on the external surface of the cell Additionally, both classes of MHC 
proteins have a peptide binding cleft on their external portions. It is in this cleft that small fragments 
of proteins, endogenous or foreign, are bound and presented to the extracellular environment 
5 Cells called professional antigen presenting cells" (pAPCs) display antigens to T cells using 

the MHC proteins but additionally express various co-stimulatory molecules depending on the 
particular state of differentiation/activation of the pAPC. When T cells, specific for the peptide bound 
to a recognizable MHC protein, bind to these MHC-peptide complexes on pAPCs, the specific co- 
stimulatory molecules that act upon the T cell direct fee path of differentiation/activation taken by the 

10 T cell. That is, the co-stimulation molecules affect how the T cell will act on antigenic signals in 
future encounters as it proceeds to the next stages of the immune response. 

As discussed above, neoplastic cells are largely ignored by the immune system. A great deal 
of effort is now being expended in an attempt to harness a host's immune system to aid in combating 
the presence of neoplastic cells in a host One such area of research involves the formulation of 

1 5 anticancer vaccines. 

Anticancer Vaccines 

Among the various weapons available to an oncologist in the battle against cancer is the 
immune system of the patient. Work has been done in various attempts to cause the immune 
system to combat cancer or neoplastic diseases. Unfortunately, the results to date have been 
20 largely disappointing. One area of particular interest involves the generation and use of anticancer 
vaccines. 

To generate a vaccine or other immunogenic composition, it is necessary to introduce to a 
subject an antigen or epitope against which an immune response may be mounted. Although 
neoplastic cells are derived from and therefore are substantially identical to normal cells on a 
25 genetic level, many neoplastic cells are known to present tumor-associated antigens (TuAAs). Li 

theory, these antigens could be used by a subject's immune system to recognize these antigens and 
attack the neoplastic cells. In reality, however, neoplastic cells generally appear to be ignored by 

the host's immune system 

A number of different strategies have been developed in an attempt to generate vaccines 

30 with activity against neoplastic cells. These strategies include the use of tumor-associated antigens 
as immunogens. For example, U.S. Patent No. 5,993,828, describes a method for producing an 
immune response against a particular subunit of the Urinary Tumor Associated Antigen by 
administering to a subject an effective dose of a composition comprising inactivated tumor cells 
having the Urinary Tumor Associated Antigen on the cell surface and at least one tumor associated 

35 antigen selected from the group consisting of GM-2, GD-2, Fetal Antigen and Melanoma 
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Associated Antigen. Accordingly, this patent describes using whole, inactivated tumor cells as the 
immunogen in an anticancer vaccine. 

Another strategy used with anticancer vaccines involves administering a composition 
containing isolated tumor antigens. In one approach, MAGE-A1 antigenic peptides were used as 
5 an immunogen. (See Chaux, P., et al, 'Identification of Five MAGE-A1 Epitopes Recognized by 
Cytolytic T Lymphocytes Obtained by In Vitro Stimulation with Dendritic Cells Transduced with 
MAGE-A1," J. Immunol., 163(5):2928-2936 (1999)). There have been several therapeutic trials 
using MAGE-A1 peptides for vaccination, although the effectiveness of the vaccination regimes 
was limited. The results of some of these trials are discussed in Vose, J.M., "Tumor Antigens 

1 0 Recognized by T Lymphocytes," 10 th European Cancer Conference, Day 2, Sept 14, 1 999. 

In another example of tumor associated antigens used as vaccines, Scheinberg, et al. 
treated 12 chronic myelogenous leukemia (CML) patients already receiving interferon (TFN) or 
hydroxyurea with 5 injections of class I-associated bcr-abl peptides with a helper peptide plus the 
adjuvant QS-21. Scheinberg, D.A., et al., "BCR-ABL Breakpoint Derived Oncogene Fusion 

15 Peptide Vaccines Generate Specific Immune Responses in Patients with Chronic Myelogenous 

Leukemia (CML) [Abstract 1665], American Society of Clinical Oncology 35* Annual Meeting, 
Atlanta (1999). Proliferative and delayed type hypersensitivity (DTH) T cell responses indicative 
of T-helper activity were elicited, but no cytolytic killer T cell activity was observed within the 
fresh blood samples. 

20 Additional examples of attempts to identify TuAAs for use as vaccines are seen in the 

recent work of Cebon, et al and Scheibenbogen, et al Cebon, et al immunized patients with 
metastatic melanoma using intradermallly administered MART-l 2 <>-35 peptide with IL-12 in 
increasing doses given either subcutaneously or intravenously. Of the first 15 patients, 1 complete 
remission, 1 partial remission, and 1 mixed response were noted. Immune assays for T cell 

25 generation included DTH, which was seen in patients with or without 11^12. Positive CTL assays 

were seen in patients with evidence of clinical benefit, but not in patients without tumor regression. 
Cebon, et al, "Phase I Studies of Immunization with Melan-A and IL-12 in HLA A2+ Positive 
Patients with Stage IE and IV Malignant Melanoma," [Abstract 1671], American Society of 
Clinical Oncology 35 th Annual Meeting, Atlanta (1999). 

30 Scheibenbogen, et al immunized 18 patients with 4 HLA class I restricted tyrosinase 

peptides, 16 with metastatic melanoma and 2 adjuvant patients. Scheibenbogen, et al, 
"Vaccination with Tyrosinase peptides and GM-CSF in Metastatic Melanoma: a Phase E Trial," 
[Abstract 1680], American Society of Clinical Oncology 35* Annual Meeting, Atlanta (1999). 
Increased CTL activity was observed in 4/15 patients, 2 adjuvant patients, and 2 patients with 

35 evidence of tumor regression. As in the trial by Cebon, et al, patients with progressive disease did 
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not show boosted immunity. In spite of the various efforts expended to date to generate efficacious 
anticancer vaccines, no such composition has yet been developed 
Antiviral Vaccines 

Vaccine strategies to protect against viral diseases have had many successes. Perhaps the 
5 most notable of these is the progress that has been made against the disease small pox, which has 
been driven to extinction. The success of the polio vaccine is of a similar magnitude. 

Viral vaccines can be grouped into three classifications: live attenuated virus vaccines, 
such as vaccinia for small pox, the Sabin poliovirus vaccine, and measles mumps and rubella; 
whole killed or inactivated virus vaccines, such as the Salk poliovirus vaccine, hepatitis A virus 
10 vaccine and the typical influenza virus vaccines; and subunit vaccines, such as hepatitis B. Due to 
their lack of a complete viral genome, subunit vaccines offer a greater degree of safety than those 

based on whole viruses. 

The paradigm of a successful subunit vaccine is the recombinant hepatitis B vaccine based 
on the viruses envelope protein. Despite much academic interest in pushing the reductionist 
15 subunit concept beyond single proteins to individual epitopes, the efforts have yet to bear much 

fruit. Viral vaccine research has also concentrated on the induction of an antibody response 
although cellular responses also occur. However, many of the subunit formulations are particularly 
poor at generating a CTL response. 

Summary of the Invention 

20 Previous methods of priming professional antigen presenting cells (pAPCs) to display 

target cell epitopes have relied simply on causing the pAPCs to express target-associated antigens 
CTAAs), or epitopes of those antigens which are thought to have a high affinity for MHC I 
molecules. However, the proteasomal processing of such antigens results in presentation of 
epitopes on the pAPC that do not correspond to the epitopes present on the target cells. 

25 Using the knowledge that an effective cellular immune response requires that pAPCs 

present the same epitope that is presented by the target cells, the present invention provides 
epitopes that have a high affinity for MHC I, and that correspond to the processing specificity of 
the housekeeping proteasome, which is active in peripheral ceils. These epitopes thus correspond 
to those presented on target cells. The use of such epitopes in vaccines can activate the cellular 

30 immune response to recognize the correctly processed TAA and can result in removal of target 

cells that present such epitopes. In some embodiments, the housekeeping epitopes provided herein 
can be used in combination with immune epitopes, generating a cellular immune response that is 
competent to attack target cells both before and after interferon induction. In other embodiments 
the epitopes are useful in the diagnosis and monitoring of the target-associated disease and in the 

3 5 generation of immunological reagents for such purposes. 
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Embodiments of the invention relate to isolated epitopes and antigens or polypeptides that 
comprise the epitopes. Preferred embodiments include an epitope or antigen having the sequence 
as disclosed in TABLE 1. Other embodiments can include an epitope cluster comprising a 
polypeptide from Table 1. Further, emodiments include a polypeptide having substantial similarity 
to the already mentioned epitopes, antigens, or clusters. Other preferred embodiments include a 
polypeptide having functional similarity to any of the above. Still further embodiments relate to a 
nucleic acid encoding the polypeptide of any of the epitopes, clusters, antigens, and polypeptides 

from Table 1 and mentioned herein. 

The epitope can be immunologically active. The polypeptide comprising the epitope can 
be less than about 30 amino acids in length, more preferably, the polypeptide is 8 to 10 amino acids 
in length, for example. Substantial or functional similarity can include addition of at least one 
amino acid, for example, and the at least one additional amino acid can be at an N-terminus of the 
polypeptide. The substantial or functional similarity can include a substitution of at least one 
amino acid. 

The epitope, cluster, or polypeptide comprising the same can have affinity to an HLA-A2 
molecule. The affinity can be determined by an assay of binding, by an assay of restriction of 
epitope recognition, by a prediction algorithm, and the like. The epitope, cluster, or polypeptide 
comprising the same can have affinity to an HLA-B7, HLA-B51 molecule, and the like. 

In preferred embodiments the polypeptide can be a housekeeping epitope. The epitope or 
polypeptide can correspond to an epitope displayed on a tumor cell, to an epitope displayed on a 
neovasculature cell, and the like. The epitope or polypeptide can be an immune epitope. The 
epitope, cluster and/or polypeptide can be a nucleic acid. 

Other embodiments relate to pharmaceutical compositions comprising the polypeptides, 
including an epitope from Table 1, a cluster, or a polypeptide comprising the same and a 
phannaceutically acceptable adjuvant, carrier, diluent, excipient, and the like. The adjuvant can be 
a polynucleotide. The polynucleotide can include a dinucleotide. The dinucleotide can be CpG, 
for example. The adjuvant can be encoded by a polynucleotide. The adjuvant can be a cytokine 
and the cytokine can be, for example, GM-CSF. 

The pharmaceutical compositions can further include a professional antigen-presenting cell 
(pAPC). The pAPC can be a dendritic cell, for example. The pharmaceutical composition can 
further include a second epitope. The second epitope can be a polypeptide. The second epitope 
can be a nucleic acid. The second epitope can be a housekeeping epitope, an immune epitope, and 
the like. 

Still further embodiments relate to pharmaceutical compositions that include any of the 
nucleic acids discussed herein, including those that encode polypeptides that comprise epitopes or 
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antigens from Table 1. Such compositions can include a pharmaceutically acceptable adjuvant, 
carrier, diluent, excipient, and the like. 

Other embodiments relate to recombinant constructs that include such a nucleic acid as 
described herein, including those that encode polypeptides that comprise epitopes or antigens from 
5 Table 1. The constructs can further include a plasmid, a viral vector, an artificial chromosome, and 
the like. The construct can further include a sequence encoding at least one feature, such as for 
example, a second epitope, an IRES, an ISS, an NIS, ubiquitin. 

Further embodiments relate to purified antibodies that specifically bind to at least one of 
the epitopes in Table 1 A. Other embodiments relate to purified antibodies that specifically bind to 
10 a peptide-MHC protein complex comprising an epitope disclosed in Table 1A or any other suitable 

epitope. The antibody from any embodiment can be a monoclonal antibody. 

Still other embodiments relate to multimeric MHC-peptide complexes that include an 
epitope, such as, for example, an epitope disclosed in Table 1 . 

Embodiments relate to isolated T cells expressing a T cell receptor specific for an MHC- 
15 peptide complex. The complex can include an epitope, such as, for example, an epitope disclosed 
in Table 1 of claim 1. The T cell can be produced by an in vitro immunization. The T cell can be 
isolated from an immunized animal. Embodiments relate to T cell clones, including cloned T cells, 
such as those discussed above. Embodiments also relate to polyclonal population of T cells. Such 
populations can include a T cell, as described above, for example. 
20 Still further embodiments relate to pharmaceutical compositions that include a T cell, such 

as those described above, for example, and a pharmaceutically acceptable adjuvant, carrier, diluent, 
excipient, and the like. 

Embodiments of the invention relate to isolated protein molecules comprising the binding 
domain of a T cell receptor specific for an MHC-peptide complex. The complex can include an 
25 epitope disclosed in Table 1 . The protein can be multivalent Other embodiments relate to isolated 

nucleic acids encoding such proteins. Still further embodiments relate to recombinant constructs 

that include such nucleic acids. 

Other embodiments of the invention relate to host cells expressing the recombinant 
construct described herein, including constructs encoding an epitope, cluster or polypeptide 
30 comprising the same, disclosed in Table 1, for example. The host cell can be a dendritic cell, 
macrophage, tumor cell, tumor-derived cell, and the like. The host cell can be a bacterium, fungus, 
protozoan and the like. Embodiments also relate to pharmaceutical compositions that include a 
host cell, such as those discussed herein, and a pharmaceutically acceptable adjuvant, carrier, 

diluent, excipient, and the like. 
35 still other embodiments relate to vaccines or immunotherapeutic compositions that include 

at least one component, such as, for example, an epitope disclosed in Table 1 or otherwise 
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described herein; a cluster that includes such an epitope, an antigen or polypeptide that includes 
such an epitope; a composition described above and herein; a construct, a T cell, or a host cell as 
described above and herein. 

Further embodiments relate to methods of treating an animal. The methods can include 
5 administering to an animal a vaccine or immunotherapeutic composition, including those disclosed 
above and herein. The administering step can include a mode of delivery, such as, for example, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
mucosal, aerosol inhalation, instillation, and the like. The method can further include a step of 
assaying to determine a characteristic indicative of a state of a target cell or target cells. The 
10 method can include a first assaying step and a second assaying step, wherein the first assaying step 

precedes the administering step, and wherein the second assaying step follows the administering 
step. The method can further include a step of comparing the characteristic determined in the first 
assaying step with the characteristic determined in the second assaying step to obtain a result. The 
result can be for example, evidence of an immune response, a diminution in number of target cells, 

r 

15 a loss of mass or size of a tumor comprising target cells, a decrease in number or concentration of 

an intracellular parasite infecting target cells, and the like. 

Embodiments relate to methods of evaluating immunogenicity of a vaccine or 
immunotherapeutic composition. The methods can include administering to an animal a vaccine or 
immunotherapeutic, such as those described above and elsewhere herein, and evaluating 

20 immunogenicity based on a characteristic of the animal. The animal can be HLA-transgenic. 

Other embodiments relate to methods of evaluating immunogenicity that include in vitro 
stimulation of a T cell with the vaccine or irranunouierapeutic composition, such as those described 
above and elsewhere herein, and evaluating immunogenicity based on a characteristic of the T cell. 
The stimulation can be a primary stimulation. 

25 Still further embodiments relate to methods of making a passive/adoptive 

immunotherapeutic. The methods can include combining a T cell or a host cell, such as those 
described above and elsewhere herein, with a pharmaceutically acceptable adjuvant, carrier, 
diluent, excipient, and the like. 

Other embodiments relate to methods of determining specific T cell frequency, and can 

30 include the step of contacting T cells with a MHC-peptide complex comprising an epitope 
disclosed in Table 1, or a complex comprising a cluster or antigen comprising such an epitope. The 
contacting step can include at least one feature, such as, for example, immunization, restimulation, 
detection, enumeration, and the like. The method can further include ELISPOT analysis, limiting 
dilution analysis, flow cytometry, in situ hybridization, the polymerase chain reaction, any 

35 combination thereof, and the like. 
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Embodiments relate to methods of evaluating immunologic response. The methods can . 
include the above-described methods determining specific T cell frequency carried out prior to and 
subsequent to an immunization step. 

Another embodiment relates to methods of evaluating immunologic response. The 
5 methods can include determining frequency, cytokine production, or cytolytic activity of T cells, 
prior to and subsequent to a step of stimulation with MHGpeptide complexes comprising an 
epitope, such as, for example an epitope from Table 1, a cluster or a polypeptide comprising such 
an epitope. 

Further embodiments relate to methods of diagnosing a disease. The methods can include 
10 contacting a subject tissue with at least one component, including, for example, a T cell, a host cell, 
an antibody, a protein, including those described above and elsewhere herein; and diagnosing the 
disease based on a characteristic of the tissue or of the component. The contacting step can take 
place in vivo. The contacting step can take place in vitro. 

Still other embodiments relate to methods of making a vaccine. The methods can include 
15 combining at least one component, an epitope, a composition, a construct, a T cell, a host cell; 

including any of those described above and elsewhere herein, with a pharmaceutical^ acceptable 
adjuvant, carrier, diluent, excipient, and the like. 

Embodiments relate to computer readable media having recorded thereon the sequence of 
any one of SEQ ID NOS: 1 -602, in a machine having a hardware or software that calculates the 
20 physical, biochemical, immunologic, or molecular genetic properties of a molecule embodying said 

sequence. 

Still other embodiments relate to methods of treating an animal. The methods can include 
combining the method of treating an animal that includes administering to the animal a vaccine or 
immunotherapeutic composition, such as described above and elsewhere herein, combined with at 
25 least one mode of treatment, including, for example, radiation therapy, chemotherapy, 

biochemotherapy, surgery, and the like. 

Further embodiments relate to isolated polypeptides that include an epitope cluster from a 
target-associated antigen having the sequence as disclosed in any one of Tables 25-44, wherein the 
amino acid sequence includes not more than about 80% of the amino acid sequence of the antigen. 
$0 Other embodiments relate to vaccines or immunotherapeutic products that include an 

isolated peptide as described above and elsewhere herein. Still other embodiments relate to 
isolated polynucleotides encoding a polypeptide as described above and elsewhere herein. Other 
embodiments relate vaccines or immunotherapeutic products that include these polynucleotides. 

■ 

The polynucleotide can be DNA or RNA. 
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Brief Description of the Drawings 
Figure 1 is a sequence alignment of NY-ESO-1 and several similar protein sequences. 
Figure 2 graphically represents a plasmid vaccine backbone useful for delivering nucleic 
acid-encoded epitopes. 

5 Figures 3A and 3B are FACS profiles showing results of HLA-A2 binding assays for 

tyrosinase207-2i5 and tyrosinase208-2i6. 

Figure 4 is a T=120 min. time point mass spectrum of the fragments produced by 

proteasomal cleavage of SSX-2 m8 . 

Figure 5 shows a binding curve for HLA-A2;SSX-2 4M9 with controls. 
10 Figure 6 shows specific lysis of SSX-2 41 ^r-pulsed targets by CTL from SSX-24 M9 - 

immunized HLA-A2 transgenic mice. 

Figure 7 A, B, and C show results of N-terminal pool sequencing of a T=60 min. time point 
aliquot of the PSMAio-iw proteasomal digest. 

Figure 8 shows binding curves for HLA-A2:PSMA 168 . 177 and HLA-A2:PSMA 288 -297 with 

15 controls. 

Figure 9 shows results of N-terminal pool sequencing of a T=60 min. time point aliquot of 
the PSMAasi-sio proteasomal digest. 

Figure 10 shows binding curves for HLA-A2:PSMA46i^S9, HLA-A2:PSMA4 6CM6 9, and 
HLA-A2:PSMA663. 6 7i. with controls. 
20 Figure 11 shows the results of a y-IFN-based EUSPOT assay detecting PSMA^^,- 

reactive HLA-A1 + CD8 + T cells. 

Figure 12 shows blocking of reactivity of the T cells used in figure 10 by anti-HLA-Al 
mAb, demonstrating HLA-Al-restricted recognition. 

Figure 1 3 shows a binding curve for HLA-A2:PSMA«3-67i, with controls. 
25 Figure 14 shows a binding curve for HLA-A2;PSMAe62-6-n, with controls. 

Figure 15. Comparison of anti-peptide CTL responses following immunization with 
various doses of DNA by different routes of injection. 

Figure 16. Growth of transplanted gp33 expressing tumor in mice immunized by i.ln. 
injection of gp33 epitope-expressing, or control, plasmid. 
30 Figure 17. Amount of plasmid DNA detected by real-time PCR in injected or draining 

lymph nodes at various times after i.ln. of i.m. injection, respectively. 

Detailed Description of the Preferred Embodiment 

Definitions 

Unless otherwise clear from the context of the use of a term herein, the following listed 
3 5 terms shall generally have the indicated meanings for purposes of this description. 
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PROFESSIONAL ANTIGEN-PRESENTING CELL (pAPC) - a cell that possesses T cell 
costimulatory molecules and is able to induce a T cell response. Well characterized pAPCs include 
dendritic cells, B cells, and macrophages. 

PERIPHERAL CELL - a cell that is not a pAPC. 

HOUSEKEEPING PROTEASOME - a proteasome normally active in peripheral cells, and 

generally not present or not strongly active in pAPCs. 

IMMUNE PROTEASOME - a proteasome normally active in pAPCs; the immune 
proteasome is also active in some peripheral cells in infected tissues. 

EPITOPE - a molecule or substance capable of stimulating an immune response, hi 
preferred embodiments, epitopes according to this definition include but are not necessarily limited 
to a polypeptide and a nucleic acid encoding a polypeptide, wherein the polypeptide is capable of 
stimulating an immune response. In other preferred embodiments, epitopes according to this 
definition include but are not necessarily limited to peptides presented on the surface of cells, the 
peptides being non-covalently bound to the binding cleft of class I MHC, such that they can 

interact with T cell receptors. 

MHC EPITOPE - a polypeptide having a known or predicted binding affinity for a 
mammalian class I or class II major histocompatibility complex (MHC) molecule. 

HOUSEKEEPING EPITOPE - In a preferred embodiment, a housekeeping epitope is 
defined as a polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which 
housekeeping proteasomes are predominantly active. In another preferred embodiment, a 
housekeeping epitope is defined as a polypeptide containing a housekeeping epitope according to 
the foregoing definition, that is flanked by one to several additional amino acids. In another 
preferred embodiment, a housekeeping epitope is defined as a nucleic acid that encodes a 
housekeeping epitope according to the foregoing definitions. 

IMMUNE EPITOPE -ma preferred embodiment, an immune epitope is defined as a 
polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which immune 
proteasomes are predominantly active. In another preferred embodiment, an immune epitope is 
defined as a polypeptide containing an immune epitope according to the foregoing definition, that 
is flanked by one to several additional amino acids. In another preferred embodiment, an immune 
epitope is defined as a polypeptide including an epitope cluster sequence, having at least two 
polypeptide sequences having a known or predicted affinity for a class I MHC. In yet another 
. preferred embodiment, an immune epitope is defined as a nucleic acid that encodes an immune 
epitope according to any of the foregoing definitions. 

TARGET CELL - a cell to be targeted by the vaccines and methods of the invention. 
Examples of target cells according to this definition include but are not necessarily hmited to: a 
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neoplastic cell and a cell harboring an intracellular parasite, such as, for example, a virus, a 

bacterium, or a protozoan. 

TARGET-ASSOCIATED ANTIGEN (TAA) - a protein or polypeptide present in a target 

cell. 

5 TUMOR-ASSOCIATED ANTIGENS (TuAA) - a TAA, wherein the target cell is a 

■ 

neoplastic cell. 

HLA EPITOPE - a polypeptide having a known or predicted binding affinity for a human 

class I or class II HLA complex molecule. 

ANTIBODY - a natural immunoglobulin (Ig), poly- or monoclonal, or any molecule 
10 composed in whole or in part of an Ig binding domain, whether derived biochemically or by use of 
recombinant DNA. Examples include inter alia, F(ab), single chain Fv, and Ig variable region- 

phage coat protein fusions. 

ENCODE - an open-ended term such that a nucleic acid encoding a particular amino acid 
sequence can consist of codons specifying that (polypeptide, but can also comprise additional 

1 5 sequences either translatable, or for the control of transcription, translation, or replication, or to 

facilitate manipulation of some host nucleic acid construct 

SUBSTANTIAL SIMILARITY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of the sequence. Nucleic 
acid sequences encoding the same amino acid sequence are substantially similar despite differences 

20 in degenerate positions or modest differences in length or composition of any non-coding regions. 

Amino acid sequences differing only by conservative substitution or minor length variations are 
substantially similar. Additionally, amino acid sequences comprising housekeeping epitopes that 
differ in the number of N-terminal flanking residues, or immune epitopes and epitope clusters that 
differ in the number of flanking residues at either terminus, are substantially similar. Nucleic acids 

25 that encode substantially similar amino acid sequences are themselves also substantially similar, 

FUNCTIONAL SIMILARITY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of a biological or 
biochemical property, although the sequences may not be substantially similar. For example, two 
nucleic acids can be useful as hybridization probes for the same sequence but encode differing 

30 amino acid sequences. Two peptides that induce cross-reactive CTL responses are functionally 

similar even if they differ by non-conservative amino acid substitutions (and thus do not meet the 
substantial similarity definition). Pairs of antibodies, or TCRs, that recognize the same epitope can 
be functionally similar to each other despite whatever structural differences exist. In testing for 
functional similarity of immunogenicity one would generally immunize with the "altered" antigen 

35 and test the ability of the elicited response (Ab, CTL, cytokine production, etc.) to recognize the 

target antigen. Accordingly, two sequences may be designed to differ in certain respects while 
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retaining the same function. Such designed sequence variants are among the embodiments of the 
present invention. 



Table 1A. SEQ ID NOS* including epitopes in Examples 1-7, 13. 



SEQ ID NO 


IDENTITY 


SEQUENCE 


i 


Tyr 207-2 16 




2 


Tyrosinase protein 


Accession numoer . r i ho / y 
Accession numoer. iNr_uujiJo 
Accession numoer. iN.r_uu*Ho / 
Accession numoer. injvi_uuuj 
Accession numoer. inm_uu j i*f / 
Accession numoer. ±nm uu44/o 




SSX-2 protem 


4 


PSMA protein 


J 


Tyrosmase cDNA 


A 


SSX-2 cDNA 


7 


PSMA cDNA 


c 
o 


Tyr 207-2 15 


CT D1X7TJDT T7T 




Tyr 208-216 




10 


SSX-2 31-68 


YFSKEEWEKMKASEKIFYvTMKRKYEAMTXLGFK 
ATLP 


n 


SSX-2 32-40 


FSKEEWEKM 


12 


SSX-2 39-47 


KMKASEKIF 


13 


SSX-2 40-48 


Mkasekify 


14 


SSX-2 39-48 


KMKASEBQFY 


15 


SSX-2 41-49 


KASEKIFYV 


16 


SSX-2 40-49 


MKASEKIFYV 


17 


SSX-2 41-50 


KASEKIFYVY 


18 


SSX-2 42-49 


ASEKIFYVY 


19 


SSX-2 53-61 


RKYEAMTKL 


20 


SSX-2 52-61 


KUKYEAMTKL 


21 


SSX-2 54-63 


KYEAMTKLGF 


22 


SSX-2 55-63 


YEAMTKLGF 


23 


SSX-2 56-63 


EAMTKLGF 


24 


HBV18-27 


FLPSDYFPSV 


25 


HLA-B44 binder 


AEMGKYSFY 


26 


SSX-1 41-49 


KYSEKISYV 


27 


SSX-3 41-49 


KVSEKIVYV 


28 


SSX-4 41-49 


KSSEKIVYV 


29 


SSX-5 41-49 


KASEKHYV 


30 


PSMA163-192 


AFSPOGMPEGDLVYVNYARTEDFFKLERDM 


31 


PSMA 168-190 


GMPEGDLVYVNYARTEDFFKLER 


32 


PSMA 169-177 


MPEGDLVYV 


33 


PSMA 168-177 


GMPEGDLVYV 


34 


PSMA 168-176 


GMPEGDLVY 


35 


PSMA 167-176 


0GMPEGDLVY 


36 IPSMA 169-176 


MPEGDLVY 
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17 


PSMA 171-179 


cfirvr \rv\7xrv 


1R 

JO 


PSMA 1 70- 1/9 


QTT/^TYT \rvr\7XTV 


1Q 


PSMA 1 74-183 


T "\r\7\7XTV A TV rC 

JLV Y VlNYAKlii 


A(\ 


PSMA 1 77-185 


VNYAKTEDF 


41 


np\ f 1 1 1 Of 

PSMA 176-185 


Y Vl\ YAK 1 EJDr 


AO 


PSMA 178-186 


NYARTEDrF 


AT. 


PSMA 1 79-186 


YARTEDFF 


A A 


PSMA 181-189 


RTEDl^rKLE 


.4.5 


PSMA 281-310 


^GlAEAVGl^SIPVHPIGYYDAQKLLEO 


A/1 
40 


PSMA 283-307 


lAEAVGIJSffVHPIGYYDAQKLLE 


4/ 


PSMA 289-297 


LPSIPVHPI 




m ^ A AAA AAA 

PSMA 288-297 


GLPSIPVHPI 




PSMA 297-305 


[GYYDAQKL 


5n 


PSMA 296-305 


PIGYYDAQKL 


CI 
D 1 


PSMA 291-299 


SIPVHPIGY 




PSMA 290-299 


PSIPVHPIGY 


5** 


ir*^*i» *T k\ AAA A /*\ a 

PSMA 292-299 


mx i r TT>Ti— <X7* 

IPVHPIGY 


5A 


wr^tf-l m at A AAA ■■S /\1 

PSMA 299-307 


YYDAQKLLE 




1\ AT A A P* A i O 1 

PSMA454-481 


SSIEGlYxIXRVDCTPLM 1 IvbL 


5fi 

JO 


PSMA 456-464 


tEGlNiTLRV 




PSMA 455-464 


siEGr»oriXRV 


5R 


PSMA 457-464 


EGNYTLRV 


50 


PSMA 461-469 


TLRVDCTPL 


ou 


AT A A A ^ VA 

PSMA 460-469 


YTLRVDCTPL 


01 


PSMA 462-470 


LRVDCTPLM 


/CO 


PSMA 463-471 


RVDC1PLMY 


63 


PSMA 462-4 / 1 


LK V JJC 1 r LM Y 


64 


PSMA653-687 


roKSNPIVl^MMNDOLMFLERAFro 


65 


PSMA 660-681 


VLRMMl^QLWI^RAFroPLGL 


66 


PSMA 663-671 


MMNDQLMFL 


67 


PSMA 662-671 


RMMNDQLMFL 


68 


PSMA 662-670 


RMMNDQLMF 


69 


Tyr 1-17 


MLLAVLYCLLWSFOTSA 



Table IB. SEQ ID NQS * tnclnding epitopes in Examples 14 and 15. 



SEQ ID NO 



70 QPlOOpTotem 



IDENTITY 



SEQUENCE 



** Accession number. P40967 



71 



MAGE-1 protein 



Accession number: P43355 



72 



MAGE-2 protein 



Accession number: P43356 



73 



MAGE-3 protein 



Accession number: P43357 



74 



NY-BSOl protein 



Accession number: P78358 



75 



LAGE- la protein 



Accession number: CAA1 1116 



wo 
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76 


" A r~LX2 1 V» -r*t«/>f airy , 

-»Avjiif-iD protein i 


&.r*f»pccinn niimher! OAA1 1117 


77 


. kajvld proiein 


Ar»r#»c«r*ti nutnhfiT NP 006106 


78 ~ 

to 


Pa A protein 


Aoppccinn number* P07288 


I 79 


J bCA protein 


'VCCCSSlun QUIDUvl • V/tJUJJ 


80 


wjrlOU COS 


A. r»/»Aeci/^T» tniTYlHpT* T 120003 
T.COCooIUn IllUIiUwl* \J£AJ\JJJ 


81 

O A 


VlAvjJtw cos 


A /tnacciHTI tlllTTinPt** A^W**R1 


82 

UA* 


yU\KjE,-Z CuS 


Aprpcdnr niimher LI 8020 

MC-L-CoolUli IlUlllUvi.. iji07Aw 


83 


MAtxb-i cas 


accession nuinuci » 


84 


NY-lloU-I CDJNA 


A /in^ccmn miml^Pt"* TTS7a1^0 

accession nuiTiucx • uo/*tJ7 


85 


PRAMb CDJN A 




86 


no A «TYXT A 

PSA CJUNA 


accession nuixiuci . inivi w i u*to 


87 

o / 


PSCA cDN A 


AnAaBMnwniitviW' AP*Oa1^a1QS 

Accession nunioer. /vru*rj*t7o 


00 


frP1 00 630-638 


LPHSSSHWL 




GP100 620-638 


OLPHSSSHWL 


00 


GP100 614-622 


LIYERRLMK 




GP1 00 613-622 

vJ A lvv VI J \JA*A> 


SLIYRRRLMK 


02 


GP100 615-622 


IYRRRLMK 1 




GP1 00 630-638 ~~ 


LPHSSSHWL 


04 


GP100 629-638 


OLPHSSSHWL 


05 


MAGE-1 95-102 


pSLFRAVI 


06 


MAGE-1 93-102 


ELESLFRAVI 


07 


MAGE-1 93-101 


ULESLFRAV 


08 


MAGE-1 92-101 

LYIJWJ X^ X ✓ «V X V X 


CILESLFRAV 


09 


MAGE-1 92-100 


CILESLFRA 




MAGE-1 263-271 

LYAxA> VJ J— « A ^V-* / A 


EFLWGPRAL 


101 


MAGE-1 264-271 


FLWGPRAL 




MAGF-1 264-273 


FLWGPRALAE 


10^ 


MAGE-1 265-274 


LWGPRALAET 


104 


MAGE-1 268-276 


PRALAETSY 


1 OS 
ill 3 


MAGE-1 267-276 

IVJXaVJ av A / aw 1/ 


GPRALAETSY 




MAGE-1 260-277 

IVI A\ V 1 1_>— A A\J7 A * • 


RALAETSYV 


107 


MAGE-1 271-279 

LYAAA.VJ A— ~ A A* / A A* / 


LAETSYVKV 




MAGEr-1 270-279 

LtAaaVJ A-/ A A> 1 V A* / ✓ 


ALAETSYVKV 


i no 


MAGE-1 272-280 


AETSYVKVL 




MAGFv! 271-280 

LV.aaV VJ aJ> A a>/ a aVUu 


LAETSYVKVL 


111 


MAGE-1 274-282 


TSYVKVLEY ~ 


112 


MAGE-1 273-282 


ETSYVKVLEY 


113 


MAGE-1 278-286 


ICVLEYVIKV 


114 


MAGE-1 168-177 


SYVLVTCLGL 


115 


MAGE-1 169-177 


YVLVTCLGL 


116 


MAGE-1 170-177 


VLVTCLGL 


117 


MAGE-1 240-248 fTQDLVQEKY 
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110 
110 


LYx/vVJXv-X AJy^tO 


LTODLVOEKY 


1 110 

i iy 


IMAGF-1 272-240 


VGEPRKLLT 


ton 


MAGE-1 245-251 


LVOEKYLEY 


171 


MAGE-1 242-251 


DLVOEKYLEY 


199 
1ZZ 


MAGE-1 230-238 


3AYGEPRKL 


1 97 
1Z3 


MAGF-I 278-286 


KVLEYVTKV 


19A 
IZ4 


NyfAGF-1 977-986 


VTCVT EYVDCV 


19<\ 
IZj 


LYx/YVJX}-l Zr / U~XrO*t 


YVKVLEYVT 

X V a\ V l-il J X T X 


19A 
IZO 


uAnF.I 974.-989 


TSYVKVLEY 

X L> X V XV V I. X 


1Z/ 


IV/TAGF-I 977-989 


FTSYVKVLFY 


1 98 
IZO 


UA(7pJ 983-901 


V XXV V LJxl_lv V XV 


1 90 

izy 


K/TAGF-1 982-201 

IVl /\ lJX-»— 1 XOi*A7l 


YVDCVSARVR 

X V XXV V tJjXXV V XV 


170 

1 JU 


K/TAGF-9 115-17? 


ELVHFLLL 

L_>X> V XXX 1 <l Jl < 


111 


\4"AOF-9 1 13-199 
vi/\.vjr<Cf"Z i ij-i^i 


MVF.T VHFT T ,L 

LVX V J-rXw V XXX J-/X>1— ' 


1 70 
1 jZ 


iv/tagt?-9 100-1 16 


XOXVXVXYX Y 


1 w 


Vl/vvXC-Z 1vO"11D 


AT55RKMVFT 


1 14 


WaHF-9 107-1 16 


VZ> XkJXVXVlVl Y it'' 


1 7^ 
1 j J 


\li A <TF-9 119-190 


KMVFT VHFT. 

CSXVJL V LLii-t Y XXx Xj 


1 7A 
1 jO 


MAGF-9 100-1 17 


ISRKMVET V 

lOXVXVXYX V X-JL* V 


1 77 


NyTA^F-9 108-117 

LVlxYvJxl-Z, IvJO-i 1 / 


ATSRKMVFT V 

rVLOXVXVaYX Y 1 < 1 < Y 


178 

1 JO 


\yTAGF-9 116-194 


Xv Y XXX i <l fl yXV X 




NAAOP-0 1 iS-i9A 


FT VHFT 1 T ICY 

1 - 1 - Y XXX J-/lvl /XV X 




\A A fiF-9 111-110 
LVLttAJ -C-Z 111-112/ 


£VXV1VX Y X_< Xv Y XIX 




\/T A/TF-9 158-166 


LOLVFG1EV 

LvV^X^i Y X VJXX^ Y 


1 AO 
14Z 


vr A/^F-9 157.166 


YT OLVFGTFV 

X X_/V^Xj Y X VJXXv Y 


1A7 
14J 


K/TAGF-9 150-167 
IYxxxvJxj Z IJjr-lO/ 


OT VFGTFW 

uJj Y X VJXXv V V 


1/1/1 


\yr A <^F-9 158-167 


I OLVFGTEW 

XvVyXv Y X VJXXv Y Y 




\yf A^F-9 164-179 
IYxAvXl}-Z 1U*t~1 /z 


TFWFWPT 

IXv V Y 11 V V X X 


140 


XyfAnF 9 167 179 
IVLAUH-Z luO-l /Z 


GTFWFWPT 

VlXXJr Y V JCi V V X 1 


1 A7 
14/ 


\/AOF-9 169-170 


FGTFWFW 

r VJlXZr V Y E» Y V 


1/10 

14o 


\/TAnt7 9 1^1_1£9 
IVlAvJxl-Z 134-10Z 


A vFYT OT VF 

/VOX-/ X XvV^Jv V X 


1/10 


\/TA^JF 9 157-169 
IVxAvJx-i-Z IjJ-IOZ 


K"ASFYT OT VF 


1 

IjU 


\^ Ar^P 9 918 99^ 
Lvli\Ori-z zio-zzj 


FFTCTWFFT 

DOM Y Y XJXJXv 


1^1 


\A A f^F-9 916-995 
JvxAvXD-Z ZIu-ZZj 


rvr x^xwxvx yy r<r<i-/ 


1 ^0 
LDZ 


KiTA/TR 9 91^ 997 
(ViAvJl-'-Z ZIO-ZZJ 


APFFTCTWF 

fVx JCrlllVX VV JC* 


153 


MAGE-2 220-228 


BLIWEELSML 


154 


MAGE-2 219-228 


EICIV^ELSML 


155 


MAGE-2 271-278 


FLWGPRAL 


156 


MAGE-2 271-279 


FLWGPRALI 


157 


MAGE-2 278-286 


LIETSYVKV 


158 


MAGE-2 277-286 


ALIETSYVKV 


159 


MAGE-2 276-284 


RALIETSYV 
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160 

X \Jr\J 


MAGE-2 279-287 


LETSYVKVL 


161 


MAGE-2 278-287 


LIETSYVKVL 


167 


MAGE-3 271-278 


FLWGPRAL 

L X^ T * *^#Jj> X** ■ ■ * 


163 


MAGE-3 270-278 


EFLWGPRAL 


164 


MAGE-3 271-279 

Lr.Lri.v-ii— • j i i. A/ y 


FLWGPRALV 


165 


MAGE-3 276-284 


RALVETSYV 


166 

J.UU 


MAGF-3 272-280 


LWGPRALVE 


167 

ID/ 


MAGF-3 971-280 

LYXrWJAw J Z. / X a-Ou 


FLWGPRAL VE 

X X-/ TT VJl lu JUL-/ T J— ' 


168 
luo 


MAGF-3 97 5-981 
ivi«rvvjJCr-j z / z» zo i 


LWGPRALVET 

X^ T * VJA Ivfl 1 / T •*— / x 


AOS' 


NTVVFQO-1 89-00 


GPESRLLEF 


170 

1 / U 


NTVJRSO-1 83-91 


PESRLLEFY 

A JJrtJ A\ 1 (1 > 1 /A A 


171 
1/1 


WV.FSO-1 89-01 


GPESRLLEFY i 


179 


NY-FSO-l 84-92 


ESRLLEFYL 


173 


NY-ESO-1 86-94 


RLLEFYLAM 


174 


NY-ESO-1 88-96 


LEFYLAMPF 


175 


NY-ESO-1 87-96 


LLEFYLAMPF 1 


176 

I/O 


KrV-FSO-l 03-102 


AMPFATPMEA 


177 


>3Y-FSO-l 94-102 


MPFATPMEA 

IT J— & A A 111 XT 1 1 /X 


178 

1/0 


NV-FSO-1 1 1 5-123 


PLPVPGVLL 


179 


NY-ESO-1 114-123 


PPLPVPGVLL 


180 


NY-FSO-1 116-123 

LN X i-'iJV-' 1 1 X V X*»J 


LPVPGVLL 


181 

1 O 1 


NY-FSO-1 103-112 


ELARRSLAOD 


189 

1 OZ 


NY-FSO-1 118-126 


VPGVLLKEF 


183 


NV-FSO-1 1 1 7-1 26 

LN x "Duvrl 1 1 /-1ZAJ 


PVPGVLLKEF 

X V X VJ V J, /I <XVX>A 




NV-FSO-1 1 1 6-1 23 

LN I -DOv"l 1 1U"1aj 


LPVPGVLL 


1R5 

lOJ 


NV F<%0-1 1 97-1 35 


TVSGTnJTLTI 

X V OvJlNXJ^ A A 


1 RA 
loo 


MV-F^D-1 196-135 


FTVSGN1LTI 

XXV kJVJXl 1 1 < A A 


187 
lew 


NV-FSO-1 190-128 

LN X MU 1 X ZrV/~ IaO 


GVLI KEFTV 

VJ T AwXvJVAvA A T 


1 88 
100 


NY-FSO-1 121-130 


\n^LKEFTVSG 

V J '» XJI A T UVJ 


180 

IOj 7 


NTV-FSO-1 122-130 


LLKEFTVSG 


100 

17 V/ 


NV-FSO-1 118-126 

LN X — X-»OV«/— XXL O L£iU 


VPGVLLKEF 


101 
171 


NY-FSO-1 117-126 

IN X "XvOv J. X A / 1£*\J 


PVPGVLLKEF 




NV-FSO-1 1*30-147 


AADHROLOL 


101 


NV-FSO-1 148-156 


STSSCLOOL 


"\ OA 


NTV-FSO-1 147-156 

LN X fiuU 1 L*t/"LJU 


t STSSCLOOL 


195 


NY-ESO-1 138-147 


TAADHRQLQL 


196 


NY-ESO-1 161-169 


WTTQCFLPV 


197 


NY-ESO-1 157-165 


SLLMWITQC 


198 


NY-ESO-1 150-158 


SSCLQQLSL 


199 


NY-ESO-1 154-162 


QQLSLLMWI 


200 


NY-ESO-1 151-159 


SCLQQLSLL 


201 


NY-ESO-1 150-159 


SSCLQQLSLL 
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202 


NY-ESO-1 163-171 


TQCFLPVFL 


i 203 


NY-ESO-1 162-171 


[TQCFLPVFL 


204 


PRAME 219-227 


PMQDKMIL 


205 


PRAME 218-227 


MPMQDIKMIL 


206 

aw \y v» 


PRAME 428-436 


QHLIGLSNL 


207 


PRAME 427-436 


LQHLIGLSNL 


208 


PRAME 429-436 

& J *■ T ^k^k^ % mm ^ • mar mm 


HLIGLSNL 


209 


PRAME 431-439 


IGLSNLTHV 


210 


PRAME 430-439 


LIGLSNLTHV 


211 

all 


PSA 53-61 ! 


VLVHPQWVL 


212 

aIa 


PSA 52-61 


GVLVHPOWVX 


213 


PSA 52-60 


GVLVHPOWV 


214 


PSA 59-67 


WVLTAAHCI 


215 


PSA 54-63 

X UXT» ■ v»/ 


LVHPQWVLTA 


216 

«1V 


x*SA 53-62 


VLVHPQWVLT 


217 


tSA 54-62 


LVHPQWVLT 


i 218 


PSA 66-73 


C1RNKSVI 


219 


IPSA 65-73 

A, IV \m m*+ t mm 


pCIRNKSVI 


220 


PSA 56-64 


HPQWVLTAA 


221 


PSA 63-72 


AAHCIRNKSV 


222 


PSCA 116-123 


LLWGPGQL 


223 


PSCA 115-123 


LLLWGPGQL 


224 


PSCA 1 14-123 


GLLLWGPGQL 


225 


PSCA 99-107 


ALQPAAAIL 


226 


PSCA 98-107 


HALQPAAAIL 


227 


Tvr 128-137 


APEKDKFFAY 




Tvr 129-137 


PEKDKFFAY 




Tvr 130-138 

JjrJ l^v J«su 


EKDKFFAYL 




Tvr 131-138 


KDKFFAYL 




Tvr 205-213 


PAFLPWHRL 


232 


Tvr 204-213 


APAFLPWHRL 


233 


Tvr 214-223 


FLLRWEOEIQ 


234 


Tvr 212-220 


RLFLLRWEQ 


235 


Tvr 191-200 


GSEIWRDIDF 


236 


Tvr 192-200 


SEIWRDIDF 


237 


Tyr 473-481 


RIWSWLLGA 


238 


Tyr 476-484 


SWLLGAAMV 


239 


Tyr 477-486 


WLLGAAMVGA 


240 


Tyr 478-486 


LLGAAMVGA 


241 


PSMA 4-12 


LLHETDSAV 


242 


PSMA 13-21 


ATARRPRWL 


243 


PSMA 53-61 


TPKHNMKAF 
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244 


OGTVA A (LA 1\ 

roMA 04-/ j 


FT KAFNIKLKF 


Zhj 


DCNAA /CO 77 


MTPCKFLH 1 NF 


24o 


DOTVif A iCB 77 

roMA Oo-/ / 


FNTTCKFLH l NF 


247 


DOTV^A 70A 70S 
roMA ZZU-ZZo 


AfrAKGVTT Y 


1/1© 

24© 


DOTV/T A /1<C_/177 
roMA. HOo-** f / 




249 


PSMA 469-477 

^ WA*v « 1 \f www 


LMYSLVHNL 


250 


PSMA 463-471 


RVDCTPLMY 


251 


PSMA 465-473 


DCTPLMYSL 


252 


PSMA 507-515 


SGMPRISKL 


253 


PSMA 506-515 


FSGMPRISKL 


254 


NY-ESO-1 136-163 


RLTAADHRQLQLSISSCUJQLSL^^ 


255 


NY-ESO-1 150-177 


SSCLQQI^LIMWnQC^PVFTAQPPSG 



1 This H was reported as Y in the SWISSPROT database. 



Table 1C. SEQ ID NOS * Including epitopes in Examplel4. 



SEO ID NO- 


IDENTITY 


SEQUENCE 


256 


Mage-1 125-132 


KAEMLESV 


257 


Mage-1 124-132 


TKAEMLESV 


258 


Mage-1 123-132 


VTKAEMLESV 


259 


Mage-1 128-136 


MLESVTKNY 


260 


Mage-1 127-136 


EMLESVIKNY 


261 


Mage-1 125-133 


KAEMLESV1 


262 


Mage-1 146-153 


KASESLQL 


263 


Mage-1 145-153 


GKASESLQL 


264 


Mage-1 147-155 


ASESLOLVF 


265 


Mage-1 153-161 


LVFGIDVKE 


266 


Mage-1 114-121 


LLKYRARE 


267 


Mage-1 106-113 


VADLVGFL 


268 


Mage-1 105-113 


KVADLVGFL 


269 


Mage-1 107-115 


ADLVGFLLL 


270 


Mage-1 106-115 


VADLVGFLLL 


271 


Mage-1 114-123 


LLKYRAREPV 


272 


Mage-3 278-286 


LVETSYVKV 


273 


Mage-3 277-286 


ALVETSYVKV 


274 


Mage-3 285-293 


KVLHHMVKI 


275 


Mage-3 283-291 


YVKVLHHMV 


276 


Mage-3 275-283 


PRALVETSY 


277 


Mage-3 274-283 


GPRALVETSY 


278 


Mage-3 278-287 


LVETSYVKVL 


279 


ED-B 4'-5 


TUPEVPQL 


280 


ED-B 5'-5 


DTHPEVPOL 


281 


ED-B 1-10 


EVPQLTDLSF 


282 


ED-B 23-30 


TPLNSSTI 


283 


ED-B 18-25 


IGLRWTPL 


284 


ED-B 17-25 


SIGLRWTPL 


285 


ED-B 25-33 


LNSSTHGY 


286 


ED-B 24-33 


PLNSSTEGY 
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287 


ED-B 23-31 


TPLNSSTH 


288 


ED-B 31-38 


IGYRITW 


289 


ED-B 30-38 


HGYRTTW 


290 


ED-B 29-38 


THGYRrrW 


291 


ED-B 31-39 


IGYRTTWA 


292 


ED-B 30-39 


nGYRTTWA 


293 


CEA 184-191 


SLPVSPRL 


294 


CEA 183-191 


OSLPVSPRL 


295 


CEA 186-193 


PVSPRLQL 


296 


CEA 185-193 


LPVSPRLQL 


297 


CEA 184-193 


SLPVSPRLQL 


298 


CEA 185-192 


LPVSPRLQ 


299 


CEA 192-200 


OLSNGNRTL 


300 


CEA 191-200 


LOLSNGNRTL 


301 


CEA 179-187 


WVNNQSLPV 


302 


CEA 186-194 


PVSPRLQLS 


303 


CEA 362-369 


SLPVSPRL 


304 


CEA 361-369 


OSLPVSPRL 


305 


CEA 364-371 


PVSPRLQL 


306 


CEA 363-371 


LPVSPRLQL 


307 


CEA 362-371 


SLPVSPRLQL 


308 


CEA 363-370 


LPVSPRLQ 


309 


CEA 370-378 


QLSNDNRTL 


310 


CEA 369-378 


LQLSNDNRTL 


311 


CEA 357-365 


WVNNQSLPV 


312 


CEA 360-368 


NQSLPVSPR 


313 


CEA 540-547 


SLPVSPRL 


314 


CEA 539-547 


QSLPVSPRL 


315 


CEA 542-549 


PVSPRLQL 


316 


CEA 541-549 


LPVSPRLQL 


317 


CEA 540-549 


SLPVSPRLQL 


318 


CEA 541-548 


LPVSPRLQ 


319 


CEA 548-556 


QLSNGNRTL 


320 


CEA 547-556 


LQLSNGNRTL 


321 


CEA 535-543 


WVNGQSLPV 


322 


CEA 533-541 


LWWVNGQSL 


323 


CEA 532-541 


YLWWVNGQSL 


324 


CEA 538-546 


GQSLPVSPR 


325 


Her-2 30-37 


DMKLRLPA 


326 


Her-2 28-37 


GTDMKLRLPA 


327 


Her-2 42-49 


HLDMLRHL 


328 


Her-2 41-49 


THLDMLRHL 


329 


Her-2 40-49 


ETHT.DMl.RHT, 


330 


Her-2 36-43 


PASPETHL 


331 


Het-2 35-43 


LPASPETHL 


332 


Her-2 34-43 


RLPASPETHL 


333 


Her-2 38-46 


SPETHLDML 


334 


Her-2 37-46 


ASPETHLDML 


335 


Her-2 42-50 


HLDMLRHLY 


336 


Her-2 41-50 


THLDMLRHLY 


337 


Her-2 719-726 


ELRKVKVL 
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338 


Her-2 718-726 


TELRKVKVL 


339 


Her-2 717-726 


ETELRKVKVL 


340 


Her-2 715-723 


LKETELRKV 


341 


Her-2 714-723 


ILKETELRKV 


342 


Her-2 712-720 


MRILKETEL 


343 


Her-2 711-720 


QMRILKETEL 


344 


Her-2 717-725 


ETELRKVKV 


345 


Her-2 716-725 


KETELRKYKV 


346 


Her-2 706-714 


MPNQAQMRI 


347 


Her-2 705-714 


AMPNQAQMRI 


348 


Her-2 706-715 


MPNQAQMRIL 


349 


HER-2 966-973 


RPRFRELV 


350 

MM* 


HER-2 965-973 


CRPRFRELV 


351 


HER-2 968-976 


RFRELVSEF 


352 


HER-2 967-976 


PRFRELVSEF 


353 


HER-2 964-972 


ECRPRFREL 


354 


NY-ESO-1 67-75 


GAASGLNGC 


355 


NY-ESO-1 52-60 


RASGPGGGA 


356 


NY-ESO-1 64-72 


PHGGAASGL 


357 


NY-ESO-1 63-72 


GPHGGAASGL 


358 


NY-ESO-1 60-69 


APRGPHGGAA 


359 


PRAME 112-119 


VRPRRWKL 


360 


PRAME 111-119 


EVRPRRWKL 


361 


PRAME 113-121 


RPRRWKLQV 


362 


PRAME 114-122 


PRRWKLQVL 


363 


PRAME 113-122 


RPRRWKLQVL 


364 


PRAME 116-124 


RWKLQVLDL 


365 


PRAME 115-124 


RRWKLQVLDL 


366 


PRAME 174-182 


PVEVLVDLF 


367 

mm* X* f 


PRAME 199-206 


VKRKKNVL 


368 


PRAME 198-206 


KVKRKKNVL 


369 


PRAME 197-206 


EK VKRKKNVL 

.bW^ -bV MM* ¥ mMh wm^^m* ^mdm obbbW MM**** ~ w b»b» 


370 


PRAME 198-205 


KVKRKKNV 


371 

v 9 mm* 


PRAME 201-208 


RKKNVLRL 


372 


PRAME 200-208 


KRKKNVLRL 

«V» Baas* aaam a*jaaa ■ ■ v^^v 


373 


PRAME 199-208 


VKRKKNVLRL 


374 


PRAME 189-196 


DELFSYLI 


375 


PRAME 205-213 


VLRLCCKKL 


376 


PRAME 204-213 


NVLRLCCKKL 


377 


PRAME 194-202 i 

A bV» •■»«■■» MM, b"' V #W W «bW 


YLIEKVKRK 


378 


PRAME 74-81 

A B*k™»B* B^bV^^bbIbW 9 • A 


QAWPFTCL 


379 


PRAME 73-81 


VQAWPFTCL 


380 


PRAME 72-81 


MVQAWPFTCL 


381 


PRAME 81-88 


LPLGVLMK 


382 


PRAME 80-88 


CLPLGVLMK 


383 


PRAME 79-88 


TCLPLGVLMK 


384 


PRAME 84-92 


GVLMKGOHL 


385 


PRAME 81-89 


LPLGVLMKG 


386 


PRAME 80-89 


CLPLGVLMKG 


387 


PRAME 76-85 


WPFTCLPLGV 


388 


PRAME 51-59 


ELFPPLFMA 
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389 


PRAME 49-57 


PRELFPPLF 


390 


PRAME 48-57 


LPRELFPPLF 


391 


PRAME 50-58 


RELFPPLFM 


392 


PRAME 49-58 


PRELFPPLFM 


393 


PSA 239-246 


RPSLYTKV 


394 


PSA 238-246 


ERPSLYTKV 


395 


PSA 236-243 


LPERPSLY 


396 


PSA 235-243 


ALPERPSLY 


397 


PSA 241-249 


SLYTKVVHY 


398 


PSA 240-249 


PSLYTKWHY 


399 


PSA 239-247 


RPSLYTKW 


400 


PSMA 211-218 


GNKVKNAO 


401 


PSMA 202-209 


IARYGKVF 


402 


PSMA 217-225 


AOLAGAKGV 


403 


PSMA 207-215 


KVFRGNKVK 


404 


PSMA 21 1-219 


GNKVKNAQL 


40S 


PSMA 269-277 


TPGYPANEY 


406 


PSMA 268-277 


LTPGYPANEY 


407 


PSMA 271-279 


GYPANEYAY 


408 


PSMA 270-279 


PGYPANEYAY 


409 


PSMA 266-274 


DPLTPGYPA 


410 


PSMA 492-500 


SLYESWTKK 


411 


PSMA 49 1-500 


KSLYESWTKK 


412 


PSMA 486-494 


EGFEGKSLY 


413 


PSMA 485-494 


DEGFEGKSLY 


414 


PSMA 498-506 


TKKSPSPEF 


415 


PSMA 497-506 


WTKKSPSPEF 


416 


PSMA 492-501 


SLYESWTKKS 


417 


PSMA 725-732 


WGEVKRQI 


418 


PSMA 724-732 


AWGEVKROI 


419 


PSMA 723-732 


KAWGEVKRQI 


420 


PSMA 723-730 


KAWGEVKR 


421 


PSMA 722-730 


SKAWGEVKR 


422 


PSMA 731-739 


OIYVAAFTV 


423 


PSMA 733-741 


YVAAFTVQA 


424 


PSMA 725-733 


WGEVKROIY 


425 


PSMA 727-735 


EVKRQIYVA 


426 


PSMA 738-746 


TVQAAAETL 


427 


PSMA 737-746 


FTVOAAAETL 


428 


PSMA 729-737 


KROIYVAAF 


429 


PSMA 721-729 


PSKAWGEVK 


430 


PSMA 723-731 


KAWGEVKRO 


431 


PSMA 100-108 


WKEFGLDSV 


432 


PSMA 99-108 


OWKEFGLDSV 


433 


PSMA 102-111 


EFGLDSVELA 


434 


SCP-1 126-134 


ELROKESKL 


435 


SCP-1 125-134 


AELROKESKL 


436 


SCP-1 133-141 


KLQENRKU 


437 


SCP-1 298-305 


QLEEKTKL 


438 


SCP-1 297-305 


NQLEEKTKL 


439 


SCP-1 288-296 


LLEESRDKV 
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440 


SCP-1 287-296 


FLLEESRDKV 


441 


SCP-1 291-299 


ESRDKVNQL 


442 


SCP-1 290-299 


EESRDKVNQL 


443 


SCP-1 475-483 


EKEVHDLEY 


444 


SCP-1 474-483 


REKEVHDLEY 


445 


SCP-1 480-488 


DLEYSYCHY 


446 


SCP-1 477-485 


EVHDLEYSY 


447 


SCP-1 477-486 


EVHDLEYSYC 


448 


SCP-1 502-509 


KLSSKREL 


449 


SCP-1 508-515 


ELKNTEYF 


450 


SCP-1 507-515 


RELKNTEYF 


451 


SCP-1 496-503 


KRGQRPKL 


452 


SCP-1 494-503 


LPKRGQRPKL 


453 


SCP-1 509-517 


1XNTKYF1L 


454 


SCP-1 508-517 


ELKNTEYFTL 


455 


SCP-1 506-514 


KRELKNTEY 


456 


SCP-1 502-510 


KLSSKRELK 


457 


SCP-1 498-506 


GORPKLSSK 


458 


SCP-1 497-506 


RGORPKLSSK 


459 


SCP-1 500-508 


RPKLSSKRE 


460 


SCP-1 573-580 


LEYVREEL 


461 


SCP-1 572-580 


ELEYVREEL 


462 


SCP-1 571-580 


NELEYVREEL 


463 


SCP-1 579-587 


ELKQKREDEV 


464 


SCP-1 575-583 


YVREELKQK 


465 


SCP-1 632-640 


QLNVYEKV 


466 


SCP-1 630-638 


SKQLNVYEI 


467 


SCP-1 628-636 


AESKQLNVY 


468 


SCP-1 627-636 


TAESKQLNVY 


469 


SCP-1 638-645 


1KVNKLEL 


470 


SCP-1 637-645 


EIKVNKLEL 


471 


SCP-1 636-645 


YEIKVNKLEL 


472 


SCP-1 642-650 


KLELELESA 


473 


SCP-1 635-643 


VYE1KVNKL 


474 


SCP-1 634-643 


NVYEKVNKL 


475 


SCP-1 646-654 


ELESAKQKF 


476 


SCP-1 642-650 


KLELELESA 


477 


SCP-1 646-654 


ELESAKQKF 


478 


SCP-1 771-778 


KEKLKREA 


479 


SCP-1 777-785 


EAKENTATL 


480 


SCP-1 776-785 


REAKENTATL 


481 


SCP-1 773-782 


KLKREAKENT 


482 


SCP-1 112-119 


EAEKIKKW 


483 


SCP-1 101-109 


GLSRVYSKL 


484 


SCP-1 100-109 


EGLSRVYSKL 


485 


SCP-1 108-116 


KLYKEAEKI 


486 


SCP-1 98-106 


NSEGLSRVY 


487 


SCP-1 97-106 


ENSEGLSRVY 


488 


SCP-1 102-110 


LSRVYSKLY 


489 


SCP-1 101-110 


GLSRVYSKLY 


490 


SCP-1 96-105 


LENSEGLSRV 
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491 


SCP-1 108-117 


KLYKEAEKJK 


492 


SCP-1 949-956 


REDRWAVI 


493 


SCP-1 948-956 


MREDRWAVI 


494 


SCP-1 947-956 


KMREDRWAVI 


495 


SCP-1 947-955 


KMREDRWAV 


496 


SCP-1 934-942 


TTPGSTLKF 


497 


SCP-1 933-942 


LTTPGSTLKF 


498 


SCP-1 937-945 


GSTLKGAI 


499 


SCP-1 945-953 


JRKMREDRW 


500 


SCP-1 236-243 


RLEMHFKL 


501 


SCP-1 235-243 


SRLEMHFKL 


502 


SCP-1 242-250 


KLKEDYEKI 


503 


SCP-1 249-257 


KIQHLEQEY 


504 


SCP-1 248-257 


EKIOHLEQEY 


505 


SCP-1 233-242 


ENSRLEMHF 


506 


SCP-1 236-245 


RLEMHFKLKE 


507 


SCP-1 324-331 


LEDIKVSL 


508 


SCP-1 323-331 


ELEDKVSL 


509 


SCP-1 322-331 


KELEDKVSL 


510 


SCP-1 320-327 


LTKELEDI 


511 


SCP-1 319-327 


HLTKELEDI 


512 


SCP-1 330-338 


SLQRSVSTQ 


513 


SCP-1 321-329 


TKELEDKV 


514 


SCP-1 320-329 


LTKELEDKV 


515 


SCP-1 326-335 


DIKVSLQRSV 


516 


SCP-1 281-288 


KMKDLTFL 


517 


SCP-1 280-288 


NKMKDLTFL 


518 


SCP-1 279-288 


ENKMKDLTFL 


519 


SCP-1 288-296 


LLEESRDKV 


520 


SCP-1 287-296 


FLLEESRDKV 


521 


SCP-1 291-299 


ESRDKVNQL 


522 


SCP-1 290-299 


EESRDKVNQL 


523 


SCP-1 277-285 


EKENKMKDL 


524 


SCP-1 276-285 


TEKENKMKDL 


525 


SCP-1 279-287 


ENKMKDLTF 


526 


SCP-1 218-225 


1EKMITAF 


527 


SCP-1 217-225 


MEKMITAF 


528 


SCP-1 216-225 


SNIEKMITAF 


529 


SCP-1 223-230 


TAFEELRV 


530 


SCP-1 222-230 


ITAFEELRV 


531 


SCP-1 221-230 


MTTAFEELRV 


532 


SCP-1 220-228 


KMITAFEEL 


533 


SCP-1 219-228 


EKMTTAFEEL 


534 


SCP-1 227-235 


ELRVQAENS 


535 


SCP-1 213-222 


DLNSNTEKMI 


536 


SCP-1 837-844 


WTSAKNTL 


537 


SCP-1 846-854 


TPLPKAYTV 


538 


SCP-1 845-854 


STPLPKAYTV 


539 


SCP-1 844-852 


LSTPLPKAY 


540 


SCP-1 843-852 


TLSTPLPKAY 


541 


SCP-1 842-850 


NTLSTPLPK 
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542 


SCP-1 841-850 


KNTLSTPLPK 


543 


SCP-1 828-835 


ISKDKRDY 


544 


SCP-1 826-835 


HGISKDKRDY 


545 


SCP-1 832-840 


KRDYLWTSA 


546 


SCP-1 829-838 


SKDKRDYLWT 


547 


SCP-1 279-286 


ENKMKDLT 


548 


SCP-1 260-268 


EINDKEKQV 


549 


SCP-1 274-282 


QTCEKENKM 


550 


SCP-1 269-277 


SLLLIOITE 


551 


SCP-1 453-460 


FEKIAEEL 


552 


SCP-1 452-460 


OFEKIAEEL 


553 


SCP-1 451-460 


KOFEKIAEEL 


554 


SCP-1 449-456 


DNKQFEKI 


555 


SCP-1 448-456 


YDNKQFEKI 


556 


SCP-1 447456 


LYDNKQFEKI 


557 


SCP-1 440-447 


LGEKETLL 


558 


SCP-1 439-447 


VLGEKETLL 


559 


SCP-1 438-447 


KVLGEKETLL 


560 


SCP-1 390-398 


LLRTEQQRL 


561 


SCP-1 389-398 


ELLRTEQQRL 


562 


SCP-1 393-401 


TEQQRLENY 


563 


SCP-1 392401 


RTEOORLENY 


564 


SCP-1 402-410 


EDOLHLTM 


565 


SCP-1 397406 


RLENYEDQU 


566 


SCP-1 368-375 


KARAAHSF 


567 


SCP-1 376-384 


WTEFETTV 


568 


SCP-1 375-384 


FVVTEFETTV 


569 


SCP-1 377-385 


VTEFETTVC 


570 


SCP-1 376-385 


WTEFETTVC 


571 


SCP-1 344-352 


DLQIATNTI 


572 


SCP-1 347-355 


IATNTICQL 


573 


SCP-1 345-355 


QIATNTICQL 


574 


SSX4 57^5 


VMTKLGFKY 


575 


SSX4 53-61 


LNYEVMTKL 


576 


SSX4 52-61 


KLNYEVMTKL 


577 


SSX4 66-74 


TLPPFMRSK 


578 


SSX4 110-118 


KIMPKKPAE 


579 


SSX4 103-112 


SLQR1FPKIM 


580 


Tyr 463471 


YIKSYLEQA 


581 


Tyr 459467 


SFODYIKSY 


582 


Tyr 458467 


DSFQDYKSY 


583 


Tyr 507-5 14 


LPEEKQPL 


584 


Tyr 506-514 


QLPEEKQPL 


585 


Tyr 505-514 


KQLPEEKOPL 


586 


Tyr 507-515 


LPEEKQPLL 


587 


Tyr 506-5 15 


QLPEEKQPLL 


588 


Tyr 497-505 


SLLCRHKRK 


589 


ED-B domain of 
Ffbronectin 


EVPQLTDLSFVDITDSSIGLRWTPLNSSniGYRI 
TWAAGEGIPIFEDFVDSSVGYYTVTGLEPGE) 
YDISV 1TLINGGESAPTTLTQQT 


590 


ED-B domain of 


CTFDNLSPGLEYNVSVYT\nKI)DKESWISDTIff 
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Fibronectin with 
flanking sequence 
from Fribronectin 


EVPQLTDLSFVDITDSSIGLRWTPLNS STHGYRI 
TWAAGEGIPIFEDFraSSVGYYTVTGLEPGiD 
YDISVITLINGGESAPTTLTQQT 
AVPPPTDLRFTNIGPDTMRVTW 


591 


ED-B domain of 
Fibronectin cds 


Accession number: X07717 


592 


CEA protein 


Accession number: P06731 


593 


CEA cDN A 


Accession number. NM 004363 


594 


Her2/Neu protein 


Accession number: P04626 


595 


Her2/Neu cDN A 


Accession number: Ml 1730 


596 


SCP-1 protein 


Accession number: 015431 


597 


SCP-1 cDNA 


Accession number: X95654 


598 


SSX-4 protein 


Accession number: 060224 


599 


SSX-4 cDNA 


Accession number: NM 005636 



*Any of SEQ ID NOS. 1, 8, 9, 11-23, 26-29, 32-44, 47-54, 56-63, 66-68 88-253, and 256-588 can 
be useful as epitopes in any of the various embodiments of the invention. Any of SEQ ID NOS. 10, 
30, 31, 45, 46, 55, 64, 65, 69, 254, and 255 can be useful as sequences containing epitopes or 
epitope clusters, as described in various embodiments of the invention. 



5 

**A11 accession numbers used here and throughout can be accessed through the NCBI databases, 
for example, through the Entrez seek and retrieval system on the world wide web. 

Note that the following discussion sets forth the inventors' understanding of the operation 

10 of the invention. However, it is not intended that this discussion limit the patent to any particular 

theory of operation not set forth in the claims. 

In pursuing the development of epitope vaccines others have generated lists of predicted 
epitopes based on MHC binding motifs. Such peptides can be immunogenic, but may not 
correspond to any naturally produced antigenic fragment so that whole antigen will not elicit a 

15 similar response or sensitize a target cell to cytolysis by CTL. Therefore such lists do not 

differentiate between those sequences that can be useful as vaccines and those that cannot. Efforts 
to determine which of these predicted epitopes are in fact naturally produced have often relied on 
screening their reactivity with tumor infiltrating lymphocytes (TEL). However, TIL are strongly 
biased to recognize immune epitopes whereas tumors (and chronically infected cells) will generally 

20 present housekeeping epitopes. Thus, unless the epitope is produced by both the housekeeping and 
immuno- proteasomes, the target cell will generally not be recognized by CTL induced with TEL- 
identified epitopes. The epitopes of the present invention, in contrast, are generated by the action a 
specified proteasome, indicating that they can be naturally produced, and enabling their appropriate 
use. The importance of the distinction between housekeeping and immune epitopes to vaccine 

25 design in more fully set forth in PCT publication WO 01/82963 A2. 

The epitopes of the invention include or encode polypeptide fragments of TAAs that are 
precursors or products of proteasomal cleavage by a housekeeping or immune proteasome, and that 
have known or predicted affinity for at least one allele of MHC L In some embodiments, the 
epitopes include or encode a polypeptide of about 6 to 25 amino acids in length, preferably about 7 
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to 20 amino acids m length, more preferably about 8 to 15 amino acids in length, and still more 
preferably 9 or 10 amino acids in length. However, it is understood that the polypeptides can be 
larger as long as they do not contain sequences that cause the polypeptides to be directed away 
from the proteasome or to be destroyed by the proteasome. For immune epitopes, if the larger 
5 peptides do not contain such sequences, they can be processed in the pAPC by the immune 
proteasome. Housekeeping epitopes may also be embedded in longer sequences provided that Ihe 
sequence is adapted to felicitate liberation of the epitope's C-terminus by action of the 
immunoproteasome. The sequences of these epitopes can be subjected to conputer analysis in 
order to calculate physical, biochemical, immunologic, or molecular genetic properties such as 
10 mass, isoelectric point, predicted mobility in electrophoresis, predicted binding to other MHC 
molecules, melting temperature of nucleic acid probes, reverse translations, similarity or homology 

to other sequences, and the like. 

In constructing the polynucleotides encoding the polypeptide epitopes of the invention, the 
gene sequence of the associated TAA can be used, or the polynucleotide can be assembled from 

15 any of the corresponding codons. For a 10 amino acid epitope this can constitute on the order of 

10 6 different sequences, depending on the particular amino acid composition. While large, this is a 
distinct and readily definable set representing a miniscule fraction of the >10 possible 
polynucleotides of this length, and thus in some embodiments, equivalents of a particular sequence 
disclosed herein encompass such distinct and readily definable variations on the listed sequence. 

20 In choosing a particular one of these sequences to use in a vaccine, considerations such as codon 
usage, self-complementarity, restriction sites, chemical stability, etc. can be used as will be 
apparent to one skilled in the art. 

The invention contemplates producing peptide epitopes. Specifically these epitopes are 
derived from the sequence of a TAA, and have known or predicted affinity for at least one allele of 

25 MHC I. Such epitopes are typically identical to those produced on target cells or pAPCs. 
Compositions Containing Active Epitopes 

Embodiments of the present invention provide polypeptide compositions, including 
vaccines, therapeutics, diagnostics, pharmacological and pharmaceutical compositions. The 
various compositions include newly identified epitopes of TAAs, as well as variants of these 

30 epitopes. Other embodiments of the invention provide polynucleotides encoding the polypeptide 

epitopes of the invention. The invention further provides vectors for expression of the polypeptide 
epitopes for purification, hi addition, the invention provides vectors for the expression of the 
polypeptide epitopes in an APC for use as an anti-tumor vaccine. Any of the epitopes or antigens, 
or nucleic acids encoding the same, from Table 1A can be used. Other embodiments relate to 

3 5 methods of making and using the various compositions. 
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A general architecture for a class I MHC-binding epitope can be described, and has been 
reviewed more extensively in Madden, D.R. Annu. Rev. Immunol 13:587-622, 1995. Much of the 
binding energy arises from main chain contacts between conserved residues in the MHC molecule 
and the N- and C-termini of the peptide. Additional main chain contacts are made but vary among 

5 MHC alleles. Sequence specificity is conferred by side chain contacts of so-called anchor residues 

with pockets that, again, vary among MHC alleles. Anchor residues can be divided into primary 
and secondary. Primary anchor positions exhibit strong preferences for relatively well-defined sets 
of amino acid residues. Secondary positions show weaker and/or less well-defined preferences that 
can often be better described in terms of less favored, rather than more favored, residues. 

1 0 Additionally, residues in some secondary anchor positions are not always positioned to contact the 
pocket on the MHC molecule at all. Thus, a subset of peptides exists that bind to a particular MHC 
molecule and have a side chain-pocket contact at the position in question and another subset exists 
that show binding to the same MHC molecule that does not depend on the conformation the 
peptide assumes in fee peptide-binding groove of the MHC molecule. The C-terminal residue (P ) 

15 is preferably a primary anchor residue. For many of the better studied HLA molecules (e,g, A2, 

A68, B27, B7, B35, and B53) the second position (P2) is also an anchor residue. However, central 
anchor residues have also been observed including P3 and P5 in HLA-B8, as well as P5 and P -3 
in the murine MHC molecules H-2D b and H-2K b , respectively. Since more stable binding will 
generally improve immunogenicity, anchor residues are preferably conserved or optimized in the 

20 design of variants, regardless of their position. 

Because the anchor residues are generally located near the ends of the epitope, the peptide 
can buckle upward out of the peptide-binding groove allowing some variation in length. Epitopes 
ranging from 8-1 1 amino acids have been found for HLA-A68, and up to 13 amino acids for HLA- 
A2. hi addition to length variation between the anchor positions, single residue truncations and 

25 extensions have been reported and the N- and C-termini, respectively. Of the non-anchor residues, 

some point up out of the groove, making no contact with the MHC molecule but being available to 
contact the TCR, very often PI, P4, and P -1 for HLA-A2. Others of the non-anchor residues can 
become interposed between the upper edges of the peptide-binding groove and the TCR, contacting 
both. The exact positioning of these side chain residues, and thus their effects on binding, MHC 

30 fine conformation, and ultimately immunogenicity, are highly sequence dependent. For an epitope 

to be highly immunogenic it must not only promote stable enough TCR binding for activation to 
occur, but the TCR must also have a high enough off-rate that multiple TCR molecules can interact 
sequentially with the same peptide-MHC complex (Kalergis, A.M. et al., Nature Immunol 2:229- 
234, 2001). Thus without further information about the ternary complex, both conservative and 

3 5 non-conservative substitutions at these positions merit consideration when designing variants. 
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The polypeptide epitope variants can be made, for example, using any of the techniques 
and guidelines for conservative and non-conservative mutations. Variants can be derived from 
substitution, deletion or insertion of one or more amino acids as compared with the native 
sequence. Amino acid substitutions can be the result of replacing one amino acid with another 
amino acid having similar structural and/or chemical properties, such as the replacement of a 
threonine with a serine. Such replacements are referred to as conservative amino acid 
replacements, and all appropriate conservative amino acid replacements are considered to be 
embodiments of one invention. Insertions or deletions can optionally be in the range of about 1 to 
4, preferably 1 to 2, amino acids. It is generally preferable to maintain the "anchor positions" of 
the peptide which are responsible for binding to the MHC molecule in question. Indeed, 
immunogenicity of peptides can be improved in many cases by substituting more preferred residues 
at the anchor positions (Franco, et al., Nature Immunology, 1(2):145-150, 2000). Immunogenicity 
of a peptide can also often be improved by substituting bulkier amino acids for small amino acids 
found in non-anchor positions while maintaining sufficient cross-reactivity with the original 
epitope to constitute a useful vaccine. The variation allowed can be determined by routine 
insertions, deletions or substitutions of amino acids in the sequence and testing the resulting 
variants for activity exhibited by the polypeptide epitope. Because the polypeptide epitope is often 
9 amino acids, the substitutions preferably are made to the shortest active epitope, for example, an 
epitope of 9 amino acids. 

Variants can also be made by adding any sequence onto the N-terminus of the polypeptide 
epitope variant. Such N-terminal additions can be from 1 amino acid up to at least 25 amino acids. 
Because peptide epitopes are often trimmed by N-terminal exopeptidases active in the pAPC, it is 
understood that variations in the added sequence can have no effect on the activity of the epitope, 
hi preferred embodiments, the amino acid residues between the last upstream proteasomal cleavage 
site and the N-terminus of the MHC epitope do not include a proline residue, Serwold, T. at al., 
Nature Immunol. 2:644-651, 2001. Accordingly, effective epitopes can be generated from 
precursors larger than the preferred 9-mer class I motif. 

Peptides are useful to the extent that they correspond to epitopes actually displayed by 
MHC I on the surface of a target cell or a pACP. A single peptide can have varying affinities for 
different MHC molecules, binding some well, others adequately, and still others not appreciably 
(Table 2). MHC alleles have traditionally been grouped according to serologic reactivity which 
does not reflect the structure of the peptide-binding groove, which can differ among different 
alleles of the same type. Similarly, binding properties can be shared across types; groups based on 
shared binding properties have been termed supertypes. There are numerous alleles of MHC I in 
the human population; epitopes specific to certain alleles can be selected based on the genotype of 
the patient. 
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Table 2. 

Predicted Binding of Tyrosinase™^* (SEP ID NO. 1) to Various MHC types 



MHC I type 


*Half time of 
i dissociation (min) 


Al 


0.05 


A*0201 


1311. 


A*0205 


50.4 


A3 


2.7 


A*l 101 (part of the A3 supertype) 


0.012 


A24 


6.0 


B7 


4.0 


B8 


8.0 


B14 (part of the B27 supertype) 


60.0 


B*2702 


0.9 


B*2705 


30.0 


B*3501 (part of the B7 supertype) 


2.0 


B*4403 


0.1 


B*5101 (part of the B7 supertype) 


26.0 


B*5102 


55.0 


B*5801 


0.20 


B60 


0.40 


B62 


2.0 



*HLA Peptide Binding Predictions (internet http:// access at bimas.dcrLnih.gov/molbio/hla_bin) 

5 

In further embodiments of die invention, the epitope, as peptide or encoding 
polynucleotide, can be administered as a vaccine or immunogenic composition, alone or in 
combination with various adjuvants, carriers, or excipients. It should be noted mat although the 
term vaccine may be used herein, the discussion can be applied and used with any of the other 

10 compositions mentioned herein. Particularly advantageous adjuvants include various cytokines and 
oligonucleotides containing immunostimulatory sequences (as set forth in greater detail in the co- 
pending applications referenced herein). Additionally the polynucleotide encoded epitope can be 
contained in a virus (e.g. vaccinia or adenovirus) or in a microbial host cell (e.g. Salmonella or 
Listeria monocytogenes) which is then used as a vector for the polynucleotide (Dietrich, G. et al. 

15 Nat Biotech. 16:181-185, 1998). Alternatively a pAPC can be transformed, ex vivo, to express the 

epitope, or pulsed with peptide epitope, to be itself adininistered as a vaccine. To increase 
efficiency of these processes, the encoded epitope can be carried by a viral or bacterial vector, or 
complexed with a ligand of a receptor found on pAPC. Similarly the peptide epitope can be 
complexed with or conjugated to a pAPC ligand. A vaccine can be composed of more than a single 

20 epitope. 

Particularly advantageous strategies for incorporating epitopes, and combining them with 
epitope clusters, into a vaccine are disclosed in U.S. Patent Application No. 09/560,465 entitled 
"EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS," filed on April 28, 
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2000. Epitope clusters for use in connection with this invention are disclosed in U.S. Patent 
Application No. 09/561,571 entitled "EPITOPE CLUSTERS," filed on April 28, 2000. 

Preferred embodiments of the present invention are directed to vaccines and methods for 
causing a pAPC or population of pAPCs to present housekeeping epitopes that correspond to the 
5 epitopes displayed on a particular target cell. Any of the epitopes or antigens in Table 1 A, can be 

used for example. In one embodiment, the housekeeping epitope is a TuAA epitope processed by 
the housekeeping proteasome of a particular tumor type. In another embodiment, the housekeeping 
epitope is a virus-associated epitope processed by the housekeeping proteasome of a cell infected 
with a virus. This facilitates a specific T cell response to the target cells. Concurrent expression 

10 by the pAPCs of multiple epitopes, corresponding to different induction states (pre- and post- 

attack), can drive a CTL response effective against target cells as they display either housekeeping 
epitopes or immune epitopes. 

By having both housekeeping and immune epitopes present on the pAPC, this embodiment 
can optimize the cytotoxic T cell response to a target cell. With dual epitope expression, the 

15 pAPCs can continue to sustain a CTL response to the immune-type epitope when the tumor cell 

switches from the housekeeping proteasome to the immune proteasome with induction by IFN, 
which, for example, may be produced by tumor-infiltrating CTLs. 

hi a preferred embodiment, immunization of a patient is with a vaccine that includes a 
housekeeping epitope. Many preferred TAAs are associated exclusively with a target cell, 

20 particularly in the case of infected cells, hi another embodiment, many preferred TAAs are the 

result of deregulated gene expression in transformed cells, but are found also in tissues of the testis, 
ovaries and fetus. In another embodiment, useful TAAs are expressed at higher levels in the target 
cell than in other cells. In still other embodiments, TAAs are not differentially expressed in the 
target cell compare to other cells, but are still useful since they are involved in a particular function 

25 of the cell and differentiate the target ceil from most other peripheral cells; in such embodiments, 

healthy cells also displaying the TAA may be collaterally attacked by the induced T cell response, 
but such collateral damage is considered to be far preferable to the condition caused by the target 
cell. 

A preferred embodiment of the present invention includes a method of administering a 
30 vaccine including a housekeeping epitope to induce a therapeutic immune response. The vaccine is 

administered to a patient in a manner consistent with the standard vaccine delivery protocols that 
are well known in the art. Methods of administering epitopes of TAAs include, without limitation, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
and mucosal administration. A particularly useful method of vaccine delivery to elicit a CTL 
35 response is disclosed in PCT Publication No. WO 99/01283, entitled "A METHOD OF 

INDUCING A CTL RESPONSE," filed on July 10, 1998. 
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Because the epitope synchronization system has utility in inducing a cell mediated immune 
response, a vaccine to induce a specific T cell response to a target cell is likewise included in a 
preferred embodiment of the present invention. The vaccine contains a housekeeping epitope in a 
concentration effective to cause a pAPC or populations of pAPCs to display housekeeping 
5 epitopes. Advantageously, the vaccine can include a plurality of housekeeping epitopes or one or 

more housekeeping epitopes optionally in combination with one or more immune epitopes. 
Formulations of the vaccine contain peptides and/or nucleic acids in a concentration sufficient to 
cause pAPCs to present the epitopes. The formulations preferably contain epitopes in a total 
concentration of about lug-lmg/lOOul of vaccine preparation. Conventional dosages and dosing 

10 for peptide vaccines and/or nucleic acid vaccines can be used with the present invention, and such 

dosing regimens are well understood in the art In one embodiment, a single dosage for an adult 
human may advantageously be from about 1 to about 5000 ul of such a composition, aclrrrinistered 
one time or multiple times, e.g., in 2, 3, 4 or more dosages separated by 1 week, 2 weeks, 1 month, 
or more, insulin pump delivers 1 ul per hour (lowest frequency) ref intranodal method patent. 

15 The compositions and methods of the invention disclosed herein further contemplate 

incorporating adjuvants into the formulations in order to enhance the performance of the vaccines. 
Specifically, the addition of adjuvants to the formulations is designed to enhance the delivery or 
uptake of the epitopes by the pAPCs. The adjuvants contemplated by the present invention are 
known by those of skill in the art and include, for example, GMCSF, GCSF, EL-2, IL-12, BCG, 

20 tetanus toxoid, osteopontin, and ETA- 1 . 

In some embodiments of the invention, the vaccines can include a recombinant organism, 
such as a virus, bacterium or parasite, genetically engineered to express an epitope in a host. For 
example, Listeria monocytogenes, a gram-positive, facultative intracellular bacterium, is a potent 
vector for targeting TuAAs to the immune system. In a preferred embodiment, this vector can be 

25 engineered to express a housekeeping epitope to induce therapeutic responses. The normal route of 

infection of this organism is through the gut and can be delivered orally. In another embodiment, 
an adenovirus (Ad) vector encoding a housekeeping epitope for a TuAA can be used to induce anti- 
virus or anti-tumor responses. Bone marrow-derived dendritic cells can be transduced with the 
virus construct and then injected, or the virus can be delivered directly via subcutaneous injection 

30 into an animal to induce potent T-cell responses. Another embodiment employs a recombinant 

vaccinia virus engineered to encode amino acid sequences corresponding to a housekeeping epitope 
for a TAA. Vaccinia viruses carrying constructs with the appropriate nucleotide substitutions in 
the form of a minigene construct can direct the expression of a housekeeping epitope, leading to a 
therapeutic T cell response against the epitope. 

35 The immunization with DNA requires that APCs take up the DNA and express the encoded 

proteins or peptides. It is possible to encode a discrete class I peptide on the DNA. By 
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immunizing with this construct, APCs can be caused to express a housekeeping epitope, which is 
then displayed on class I MHC on the surface of the cell for stimulating an appropriate CTL 
response. Constructs generally relying on termination of translation or non-proteasomal proteases 
for generation of proper tennmi of housekeeping epitopes have been described in U.S. Patent 
5 application No. 09/561,572 entitled EXPRESSION VECTORS ENCODING EPITOPES OF 

TARGET-ASSOCIATED ANTIGENS, filed on April 28, 2000. 

It can be desirable to express housekeeping peptides in the context of a larger protein. 
Processing can be detected even when a small number of amino acids are present beyond the 
terminus of an epitope. Small peptide hormones are usually proteolytically processed from longer 

1 0 translation products, often in the size range of approximately 60-120 amino acids. This fact has led 

some to assume that this is the minimum size that can be efficiently translated. In some 
embodiments, the housekeeping peptide can be embedded in a translation product of at least about 
60 amino acids. In other embodiments the housekeeping peptide can be embedded in a translation 
product of at least about 50, 30, or 1 5 amino acids. 

15 Due to differential proteasomal processing, the immune proteasome of the pAPC produces 

peptides that are different from those produced by the housekeeping proteasome in peripheral body 
cells. Thus, in expressing a housekeeping peptide in the context of a larger protein, it is preferably 
expressed in the APC in a context other than its full length native sequence, because, as a 
housekeeping epitope, it is generally only efficiently processed from the native protein by the 

20 housekeeping proteasome, which is not active in the APC. In order to encode the housekeeping 

epitope in a DNA sequence encoding a larger protein, it is useful to find flanking areas on either 
side of the sequence encoding the epitope that permit appropriate cleavage by the immune 
proteasome in order to liberate that housekeeping epitope. Such a sequence ensuring epitope 
synchronization is referred to hereinafter as a SYNCHROTOPE™. Altering flanking amino acid 

25 residues at the N-terminus and G-tenrrinus of the desired housekeeping epitope can facilitate 

appropriate cleavage and generation of the housekeeping epitope in the APC. Sequences 
embedding housekeeping epitopes can be designed de novo and screened to determine which can 
be successfully processed by immune proteasomes to liberate housekeeping epitopes. 

Alternatively, another strategy is very effective for identifying sequences allowing 

30 production of housekeeping epitopes in APC. A contiguous sequence of amino acids can be 

generated from head to tail arrangement of one or more housekeeping epitopes. A construct 
expressing this sequence is used to immunize an animal, and the resulting T cell response is 
evaluated to determine its specificity to one or more of the epitopes in the array. By definition, 
these immune responses indicate housekeeping epitopes that are processed in the pAPC effectively. 

35 The necessary flanking areas around this epitope are thereby defined. The use of flanking regions 

of about 4-6 amino acids on either side of the desired peptide can provide the necessary 
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information to fecilitate proteasomc processing of the housekeeping epitope by the immune 
proteasome. Therefore, a SYNCHROTOPE™ of approximately 16-22 amino acids can be inserted 
into, or fused to, any protein sequence effectively to result in that housekeeping epitope being 
produced in an APC. In alternate embodiments the whole head-to-tail array of epitopes, or just the 
5 epitopes immediately adjacent to the correctly processed housekeeping epitope can be similarly 

transferred from a test construct to a vaccine vector. 

Li a preferred embodiment, the housekeeping epitopes can be embedded between known 
immune epitopes, or segments of such, thereby providing an appropriate context for processing. 
The abutment of housekeeping and immune epitopes can generate the necessary context to enable 

10 the immune proteasome to liberate the housekeeping epitope, or a larger fragment, preferably 
including a correct C-terminus. It can be useful to screen constructs to verify that the desired 
epitope is produced The abutment of housekeeping epitopes can generate a site cleavable by the 
immune proteasome. Some embodiments of the invention employ known epitopes to flank 
housekeeping epitopes in test substrates; in others, screening as described below are used whether 

15 the flanking regions are arbitrary sequences or mutants of the natural flanking sequence, and 

whether or not knowledge of proteasomal cleavage preferences are used in designing the 
substrates. 

Cleavage at the mature N-terminus of the epitope, while advantageous, is not required, 
since a variety of N-terminal trimming activities exist in the cell that can generate the mature N- 

20 terminus of the epitope subsequent to proteasomal processing. It is preferred that such N-terrninal 

extension be less than about 25 amino acids in length and it is further preferred that the extension 
have few or no proline residues. Preferably, in screening, consideration is given not only to 
cleavage at the ends of the epitope (or at least at its C-terminus), but consideration also can be 
given to ensure limited cleavage within the epitope. 

25 Shotgun approaches can be used in designing test substrates and can increase the efficiency 

of screening. In one embodiment multiple epitopes can be assembled one after the other, with 
individual epitopes possibly appearing more than once. The substrate can be screened to determine 
which epitopes can be produced, hi the case where a particular epitope is of concern a substrate can 
be designed in which it appears in multiple different contexts. When a single epitope appearing in 

30 more than one context is liberated from the substrate additional secondary test substrates, in which 
individual instances of the epitope are removed, disabled, or are unique, can be used to determine 
which are being liberated and truly constitute SYNCHROTOPE™s. 

Several readily practicable screens exist. A preferred in vitro screen utilizes proteasomal 
digestion analysis, using purified immune proteasomes, to determine if the desired housekeeping 

35 epitope can be liberated from a synthetic peptide embodying the sequence in question. The position 

of the cleavages obtained can be determined by techniques such as mass spectrometry, HPLC, and 
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N-terminal pool sequencing; as described in greater detail in U. S. Patent Applications entitled 
METHOD OF EPITOPE DISCOVERY, EPITOPE SYNCHRONIZATION IN ANTIGEN 
PRESENTING CELLS, two Provisional U. S. Patent Applications entitled EPITOPE 
SEQUENCES, which are all cited and incorporated by reference above. 
5 Alternatively, in vivo screens such as immunization or target sensitization can be 

employed. For immunization a nucleic acid construct capable of expressing the sequence in 
question is used. Harvested CTL can be tested for their ability to recognize target cells presenting 
the housekeeping epitope in question. Such targets cells are most readily obtained by pulsing cells 
expressing the appropriate MHC molecule with synthetic peptide embodying the mature 

10 housekeeping epitope. Alternatively, cells known to express housekeeping proteasome and the 
antigen from which the housekeeping epitope is derived, either endogenously or through genetic 
engineering, can be used. To use target sensitization as a screen, CTL, or preferably a CTL clone, 
that recognizes the housekeeping epitope can be used. In this case it is the target cell that expresses 
the embedded housekeeping epitope (instead of the pAPC during immunization) and it must 

15 express immune proteasome. Generally, the target cell can be transformed with an appropriate 

nucleic acid construct to confer expression of the embedded housekeeping epitope. Loading with a 
synthetic peptide embodying the embedded epitope using peptide loaded liposomes or a protein 
transfer reagent such as BIOPORTER™ (Gene Therapy Systems, San Diego, CA) represents an 
alternative. 

20 Additional guidance on nucleic acid constructs useful as vaccines in accordance with the 

present invention are disclosed in U.S. Patent Application No. 09/561,572 entitled EXPRESSION 
VECTORS ENCODING EPITOPES OP TARGET-ASSOCIATED ANTIGENS," filed on April 
28, 2000. Further, expression vectors and methods for their design, which are useful in accordance 
with the present invention are disclosed in U.S. Patent Application No. 60/336,968 (attorney 

25 docket number CTLIMM.022PR) entitled "EXPRESSION VECTORS ENCODING EPITOPES 

OF TARGET-ASSOCIATED ANTIGENS AND METHODS FOR THEIR DESIGN," filed on 
1 1/7/2001, which is incorporated by reference in its entirety. 

A preferred embodiment of the present invention includes a method of administering a 
vaccine including an epitope (or epitopes) to induce a therapeutic immune response. The vaccine 

30 is administered to a patient in a manner consistent with the standard vaccine delivery protocols that 

are known in the art. Methods of administering epitopes of TAAs including, without limitation, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
and mucosal administration, including delivery by injection, instillation or inhalation. A 
particularly useful method of vaccine delivery to elicit a CTL response is disclosed in Australian 

35 Patent No. 739189 issued January 17, 2002; U.S. Patent Application No. 09/380,534, filed on 
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September 1, 1999; and a Contmuation-in-Part thereof U.S. Patent Application No. 09/776,232 
both entitled "A METHOD OF INDUCING A CTL RESPONSE," filed on February 2, 2001. 
Reagents Recojgnizhig Epitopes 

In another aspect of the invention, proteins with binding specificity for the epitope and the 
5 epitope-MHC molecule complex are contemplated, as well as the isolated cells by which they can 

be expressed. In one set of embodiments these reagents take the form of immunoglobulins: 
polyclonal sera or monoclonal antibodies (mAb), methods for the generation of which are well 
know in the art. Generation of mAb with specificity for peptide-MHC molecule complexes is 
known in the art. See, for example, Aharoni et al. Nature 351:147-150, 1991; Andersen et al. Proc. 

10 Nail Acad. Set USA 93:1820-1824, 1996; Dadaglio et al. Immunity 6:727-738, 1997; Due et al. 

Int. Immunol 5:427-431,1993; Eastman et al. Eur. J. Immunol 26:385-393, 1996; Engberg et al. 
Immunotechnology 4:273-278, 1999; Porgdor et al. Immunity 6:715-726, 1997; Puri et al. J. 
Immunol 158:2471-2476, 1997; and Polakova, K., et al. J. Immunol 165 342-348, 2000; all of 
which are hereby incorporated by reference in their entirety. 

15 In other embodiments the compositions can be used to induce and generate, in vivo and in 

vitro, T-cells specific for the any of the epitopes, including those listed in Table 1A, for example. 
Thus, embodiments also relate to and include isolated T cells, T cell clones, T cell hybridomas, or a 
protein containing the T cell receptor (ICR) binding domain derived from the cloned gene, as well 
as a recombinant cell expressing such a protein. Such TCR derived proteins can be simply the 

20 extra-cellular domains of the TCR, or a fusion with portions of another protein to confer a desired 

property or function. One example of such a fusion is the attachment of TCR binding domains to 
the constant regions of an antibody molecule so as to create a divalent molecule. The construction 
and activity of molecules following this general pattern have been reported, for example, Plaksin, 
D. et al. J. Immunol 158:2218-2227, 1997 and Lebowitz, M.S. et al. Cell Immunol 192:175-184, 

25 1999, which are hereby incorporated by reference in their entirety. The more general construction 

and use of such molecules is also treated in U.S. patent 5,830,755 entitled T CELL RECEPTORS 
AND THEIR USE IN THERAPEUTIC AND DIAGNOSTIC METHODS, which is hereby 
incorporated by reference in its entirety. 

The generation of such T cells can be readily accomplished by standard immunization of 

30 laboratory animals, and reactivity to human target cells can be obtained by immunizing with human 

target cells or by immunizing HLA-transgenic animals with the antigen/epitope. For some 
therapeutic approaches T cells derived from the same species are desirable. While such a cell can 
be created by cloning, for example, a murine TCR into a human T cell as contemplated above, in 
vitro immunization of human cells offers a potentially faster option. Techniques for in vitro 

35 immunization, even using naive donors, are know in the field, for example, Stauss et al., Proc. 

Natl Acad. Sci. USA 89:7871-7875, 1992; Salgaller et al. Cancer Res. 55:4972-4979, 1995; Tsai et 
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al., J. Immunol 158:1796-1802, 1997; and Chung et al., 7. Immunotker. 22:279-287, 1999; which 
are hereby incorporated by reference in their entirety. 

Any of these molecules can be conjugated to enzymes, radiochemicals, fluorescent tags, 
and toxins, so as to be used in the diagnosis (imaging or other detection), monitoring, and treatment 
5 of the pathogenic condition associated with the epitope. Thus a toxin conjugate can be 

administered to kill tumor cells, radiolabeling can facilitate imaging of epitope positive tumor, an 
enzyme conjugate can be used in an ELISA-like assay to diagnose cancer and confirm epitope 
expression in biopsied tissue. In a further embodiment, such T cells as set forth above, following 
expansion accomplished through stimulation with the epitope and/or cytokines, can be 

1 0 administered to a patient as an adoptive immunotherapy. 
Reagents Comprising Epitopes 

A further aspect of the invention provides isolated epitope-MHC complexes. In a 
particularly advantageous embodiment of this aspect of the invention, the complexes can be 
soluble, multimeric proteins such as those described in U. S. Patent No. 5,635,363 (tetramers) or U. 

15 S. Patent No. 6,015,884 (Ig-dimers), both of which are hereby incorporated by reference in their 

entirety. Such reagents are useful in detecting and monitoring specific T cell responses, and in 
purifying such T cells. 

Isolated MHC molecules complexed with epitopic peptides can also be incorporated into 
planar lipid bilayers or liposomes. Such compositions can be used to stimulate T cells in vitro or, in 

20 the case of liposomes, in vivo. Co-stimulatory molecules (e.g. B7, CD40, LFA-3) can be 

incorporated into the same compositions or, especially for in vitro work, co-stimulation can be 
provided by anti-co-receptor antibodies (e.g. anti-CD28, anti-CD154, anti-CD2) or cytokines (e.g. 
IL-2, IL-12). Such stimulation of T cells can constitute vaccination, drive expansion of T cells in 
vitro for subsequent infusion in an immuotherapy, or constitute a step in an assay of T cell 

25 function. 

The epitope, or more directly its complex with an MHC molecule, can be an important 
constituent of functional assays of antigen-specific T cells at either an activation or readout step or 
both. Of the many assays of T cell function current in the art (detailed procedures can be found in 
standard immunological references such as Current Protocols in Immunology 1999 John Wiley & 

30 Sons Inc., N.Y., which is hereby incorporated by reference in its entirety) two broad classes can be 

defined, those that measure the response of a pool of cells and those that measure the response of 
individual cells. Whereas the former conveys a global measure of the strength of a response, the 
latter allows determination of the relative frequency of responding cells. Examples of assays 
measuring global response are cytotoxicity assays, ELISA, and proliferation assays detecting 

35 cytokine secretion. Assays measuring the responses of individual cells (or small clones derived 
from them) include limiting dilution analysis (LDA), ELISPOT, flow cytometric detection of 
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unsecreted cytokine (described in U.S. Patent No. 5,445,939, entitled "METHOD FOR 
ASSESSMENT OF THE MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM" and U.S. Patent 
Nos 5,656,446; and 5,843,689, both entitled "METHOD FOR THE ASSESSMENT OF THE 
MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM," reagents for which are sold by Becton, 
5 Dickinson & Company under the tradename TASTIMMUNE', which patents are hereby 
incorporated by reference in their entirety) and detection of specific TCR with tetramers or Ig- 
dimers as stated and referenced above. The comparative virtues of these techniques have been 
reviewed in Yee, C. et al. Current Opinion in Immunology, 13:141-146, 2001, which is hereby 
incorporated by reference in its entirety. Additionally detection of a specific TCR rearrangement 

10 or expression can be accomplished through a variety of established nucleic acid based techniques, 

particularly in situ and single-cell PCR techniques, as will be apparent to one of skill in the art. 

These functional assays are used to assess endogenous levels of immunity, response to an 
immunologic stimulus (e.g. a vaccine), and to monitor immune status through the course of a 
disease and treatment. Except when measuring endogenous levels of immunity, any of these assays 

1 5 presume a preliminary step of immunization, whether in vivo or in vitro depending on the nature of 

the issue being addressed. Such immunization can be carried out with the various embodiments of 
the invention described above or with other forms of immunogen (e.g., pAPC-tumor cell fusions) 
that can provoke similar immunity. With the exception of PCR and tetramer/Ig-dimer type 
analyses which can detect expression of the cognate TCR, these assays generally benefit from a 

20 step of in vitro antigenic stimulation which can advantageously use various embodiments of the 

invention as described above in order to detect the particular functional activity (highly cytolytic 
responses can sometimes be detected directly). Finally, detection of cytolytic activity requires 
cpitope-displaying target cells, which can be generated using various embodiments of the 
invention. The particular embodiment chosen for any particular step depends on the question to be 

25 addressed, ease of use, cost, and the like, but the advantages of one embodiment over another for 
any particular set of circumstances will be apparent to one of skill in the art. 
Tumor Associa ted Antigens 

Epitopes of the present invention are derived from the TuAAs tyrosinase (SEQ ID NO. 2), 
SSX-2, (SEQ ED NO. 3), PSMA (prostate-specific membrane antigen) (SEQ ID NO. 4), GP100, 

30 (SEQ ID NO. 70), MAGE-1, (SEQ ID NO. 71), MAGE-2, (SEQ ID NO. 72), MAGE-3, (SEQ ID 

NO. 73), NY-ESO-1, (SEQ ED NO. 74), PRAME, (SEQ ID NO. 77), PSA, (SEQ ID NO. 78), and 
PSCA, (SEQ ID NO. 79). The natural coding sequences for these eleven proteins, or any segments 
within them, can be determined from their cDNA or complete coding (cds) sequences, SEQ ID 
NOS. 5-7, and 80-87, respectively. 

35 Tyrosinase is a melanin biosynthetic enzyme that is considered one of the most specific 

markers of melanocytic differentiation. Tyrosinase is expressed in few cell types, primarily in 
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melanocytes, and high levels are often found in melanomas. The usefulness of tyrosinase as a 
TuAA is taught in U.S. Patent 5,747,271 entitled "METHOD FOR IDENTIFYING INDIVIDUALS 
SUFFERING FROM A CELLULAR ABNORMALITY SOME OF WHOSE ABNORMAL 
CELLS PRESENT COMPLEXES OF HLA-A2/TYROSINASE DERIVED PEPTIDES, AND 
5 METHODS FOR TREATING SAID INDIVIDUALS" which is hereby incorporated by reference 

in its entirety. 

GP100, also known as PMell7, also is a melanin biosynthetic protein expressed at high 
levels in melanomas. GP100 as a TuAA is disclosed in U.S. Patent 5,844,075 entitled 
'MELANOMA ANTIGENS AND THEIR USE IN DIAGNOSTIC AND THERAPEUTIC 

1 0 METHODS," which is hereby incorporated by reference in its entirety. 

SSX-2, also know as Hom-Mel-40, is a member of a family of highly conserved cancer- 
testis antigens (Gure, A.O. et al. Int. J. Cancer 72:965-971, 1997, which is hereby incorporated by 
reference in its entirety). Its identification as a TuAA is taught in U.S. Patent 6,025,191 entitled 
"ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE A MELANOMA SPECIFIC 

15 ANTIGEN AND USES THEREOF," which is hereby incorporated by reference in its entirety. 

Cancer-testis antigens are found in a variety of tumors, but are generally absent from normal adult 
tissues except testis. Expression of different members of the SSX family have been found 
variously in tumor cell lines. Due to the high degree of sequence identity among SSX family 
members, similar epitopes from more than one member of the femily will be generated and able to 

20 bind to an MHC molecule, so that some vaccines directed against one member of this family can 
cross-react and be effective against other members of this family (see example 3 below). 

MAGE-1, MAGE-2, and MAGE-3 are members of another family of cancer -testis antigens 
originally discovered in melanoma (MAGE is a contraction of melanoma-associated antigen) but 
found in a variety of tumors. The identification of MAGE proteins as TuAAs is taught in U.S. 

25 Patent 5,342,774 entitled NUCLEOTIDE SEQUENCE ENCODING THE TUMOR REJECTION 

ANTIGEN PRECURSOR, MAGE-1, which is hereby incorporated by reference in its entirety, and 
in numerous subsequent patents. Currently there are 17 entries for (human) MAGE in the SWISS 
Protein database. There is extensive similarity among these proteins so in many cases, an epitope 
from one can induce a cross-reactive response to other members of the ramily. A few of these have 

30 not been observed in tumors, most notably MAGE-HI and MAGE-D1, which are expressed in 
testes and brain, and bone marrow stromal cells, respectively. The possibility of cross-reactivity on 
normal tissue is ameliorated by the fact that they are among the least similar to the other MAGE 
proteins. 

NY-ESO-1, is a cancer-testis antigen found in a wide variety of tumors, also known as 
35 CTAG-1 (Cancer-Testis Antigen- 1) and CAG-3 (Cancer Antigen-3). NY-ESO-1 as a TuAA is 

disclosed in U.S. Patent 5,804,381 entitled ISOLATED NUCLEIC ACID MOLECULE 
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ENCODING AN ESOPHAGEAL CANCER ASSOCIATED ANTIGEN, THE ANTIGEN ITSELF, 
AND USES THEREOF which is hereby incorporated by reference in its entirety. A paralogous 
locus encoding antigens with extensive sequence identity, LAGE-la/s (SEQ ID NO. 75) and 
LAGE-lb/L (SEQ ID NO. 76), have been disclosed in publicly available assemblies of the human 

5 genome , and have been concluded to arise through alternate splicing. Additionally, CT-2 (or 

CTAG-2, Cancer-Testis Antigen-2) appears to be either an allele, a mutant, or a sequencing 
discrepancy of LAGE-lb/L. Due to the extensive sequence identity, many epitopes fromNY-ESO- 
1 can also induce immunity to tumors expressing these other antigens. See figure 1. The proteins 
are virtually identical through amino acid 70. From 71-134 the longest run of identities between 

10 NY-ESO-1 and LAGE is 6 residues, but potentially cross-reactive sequences are present And 
from 135-180 NY-ESO and LAGE-la/s are identical except for a single residue, but LAGE-lb/L is 
unrelated due to the alternate splice. The CAMEL and LAGE-2 antigens appear to derive from the 
LAGE-1 mKNA, but from alternate reading frames, thus giving rise to unrelated protein sequences. 
More recently, GenBank Accession AF277315.5, Homo sapiens chromosome X clone RP5- 

15 865E18, RP5-1087L19, complete sequence, reports three independent loci in this region which are 

labeled as LAGE1 (corresponding to CTAG-2 in the genome assemblies), plus LAGE2-A and 
LAGE2-B (both corresponding to CTAG-1 in the genome assemblies). 

PSMA (prostate-specific membranes antigen), a TuAA described in U.S. Patent 5,538,866 
entitled "PROSTATE-SPECIFIC MEMBRANES ANTIGEN" which is hereby incorporated by 

20 reference in its entirety, is expressed by normal prostate epithelium and, at a higher level, in 
prostatic cancer. It has also been found in the neovasculature of non-prostatic tumors. PSMA can 
thus form the basis for vaccines directed to both prostate cancer and to the neovasculature of other 
tumors. This later concept is more fully described in a provisional U.S. Patent application No. 
60/274,063 entitled ANH-NEOVASCULAR VACCINES FOR CANCER, filed March 7, 2001, 

25 and U.S. Application No. _/ , attorney docket number CTLMM.015A, filed on March 7, 

2002, entitled "ANTI-NEOVASCULAR PREPARATIONS FOR CANCER," both of which are 
hereby incorporated by reference in their entirety. Alternate splicing of the PSMA mRNA also 
leads to a protein with an apparent start at Met 58 , thereby deleting the putative membrane anchor 
region of PSMA as described in U.S. Patent 5,935,818 entitled 'ISOLATED NUCLEIC ACID 

30 MOLECULE ENCODING ALTERNATIVELY SPLICED PROSTATE-SPECIFIC 
MEMBRANES ANTIGEN AND USES THEREOF" which is hereby incorporated by reference in 
its entirety. A protein termed PSMA-like protein, Genbank accession number AF261715, is nearly 
identical to amino acids 309-750 of PSMA and has a different expression profile. Thus the most 
preferred epitopes are those with an N-terminus located from amino acid 58 to 308. 

35 PRAME, also know as MAPE, DAGE, and OIP4, was originally observed as a melanoma 

antigen. Subsequently, it has been recognized as a CT antigen, but unlike many CT antigens (e.g., 
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MAGE, GAGE, and BAGE) it is expressed in acute myeloid leukemias. PRAME is a member of 
the MAPE family which consists largely of hypothetical proteins with which it shares limited 
sequence similarity. The usefulness of PRAME as a TuAA is taught in U.S. Patent 5,830,753 
entitled "ISOLATED NUCLEIC ACID MOLECULES CODING FOR TUMOR REJECTION 
5 ANTIGEN PRECURSOR DAGE AND USES THEREOF' which is hereby incorporated by 

reference in its entirety. 

PSA, prostate specific antigen, is a peptidase of the kallikrein family and a differentiation 
antigen of the prostate. Expression in breast tissue has also been reported. Alternate names include 
gamnia-serninoprotein, kallikrein 3, seminogelase, seminin, and P-30 antigen. PSA has a high 

10 degree of sequence identity with the various alternate splicing products prostatic/glandular 
kallikrein- 1 and -2, as well as kalikrein 4, which is also expressed in prostate and breast tissue. 
Other kallikreins generally share less sequence identity and have different expression profiles. 
Nonetheless, cross-reactivity that might be provoked by any particular epitope, along with the 
likelihood that that epitope would be liberated by processing in non-target tissues (most generally 

15 by the housekeeping proteasome), should be considered in designing a vaccine. 

PSCA, prostate stem cell antigen, and also known as SCAH-2, is a differentiation antigen 
preferentially expressed in prostate epithelial cells, and overexpresssed in prostate cancers. Lower 
level expression is seen in some normal tissues including neuroendocrine cells of the digestive tract 
and collecting ducts of the kidney. PSCA is described in U.S. Patent 5,856,136 entitled 'HUMAN 

20 STEM CELL ANTIGENS" which is hereby incorporated by reference in its entirety. 

Synaptonemal complex protein 1 (SCP-1), also known as HOM-TES-14, is a meiosis- 
associated protein and also a cancer-testis antigen (Tureci, O., et a\. Proc. Natl. Acad. Sci. USA 
95:521 1-5216, 1998). As a cancer antigen its expression is not cell-cycle regulated and it is found 
frequently in gliomas, breast, renal cell, and ovarian carcinomas. It has some similarity to myosins, 

25 but with few enough identities that crossrreactive epitopes are not an immediate prospect. 

The ED-B domain of fibronectin is also a potential target Fibronectin is subject to 
developmentally regulated alternative splicing, with the ED-B domain being encoded by a single 
exon that is used primarily in oncofetal tissues (Matsuura, H. and S. Hakomori Proc. Natl Acad. 
Sci. USA 82:6517-6521, 1985; Carnemolla, B. et al. J. Cell Biol. 108:1139-1148, 1989; Loridon- 

30 Rosa, B. et al. Cancer Jto.50:1608-1612, 1990; Nicolo, G. et al. Cell Differ. Dev. 32:401-408, 
1990; Borsi, L. et al. Exp. Cell Res. 199:98-105, 1992; Oyama, F. et al. Cancer Res. 53:2005-2011, 
1993; Mandel, U. et al. APMIS 102:695-702, 1994; Farnoud, M.R. et al. Int. J. Cancer 61:27-34, 
1995; Pujuguet, P. et al. Am. J. Pathol. 148:579-592, 1996; Gabler, U. et al. Heart 75:358-362, 
1996;Chevalier, X. Br. J. Rheumatol 35:407-415, 1996; Midulla, M. Cancer Res. 60:164-169, 

35 2000). 
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The ED-B domain is also expressed in fibronectin of the neovasculature (Kaczmarek, J. et 
al. Int. J. Cancer 59:11-16, 1994; Castellani, P. et al. Int. J. Cancer 59:612-618, 1994; Neri, D. et 
al. Nat Biotech. 15:1271-1275, 1997; Karelina, T.V. and A.Z. Eisen Cancer Detect. Prev. 22:438- 
444, 1998; Tarli, L. et al. Blood 94:192-198, 1999; Castellani, P. et al. Acta Neurochir. (Wien) 
5 142:277-282, 2000). As an oncofetal domain, the ED-B domain is commonly found in the 

fibronectin expressed by neoplastic cells in addition to being expressed by the neovasculature. 
Thus, CTL-inducing vaccines targeting the ED-B domain can exhibit two mechanisms of action: 
direct lysis of tumor cells, and disruption of the tumor's blood supply through destruction of the 
tumoT-associated neovasculature. As CTL activity can decay rapidly after withdrawal of vaccine, 

10 interference with normal angiogenesis can be minimal. The design and testing of vaccines targeted 
to neovasculature is described in Provisional U.S. Patent Application No. 60/274,063 entitled 
"ANTI-NEOVASCULATURE VACCINES FOR CANCER" and in U.S. Patent Application No. 

/ attorney docket number CTLIMM.015A, entitled "ANTI-NEOVASCULATURE 

PREPARATIONS FOR CANCER, filed on date even with this application (March 7, 2002). A 

1 5 tumor cell line is disclosed in Provisional U.S. Application No. / filed on March 7, 2002, 

attorney docket number CTLIMM.028PR, entitled 4 HL A-TRANS GENIC MURINE TUMOR 
CELL LINE," which is hereby incorporated by reference in its entirety. 

Carcinoembryonic antigen (CEA) is a paradigmatic oncofetal protein first described in 
1965 (Gold and Freedman, J. Exp. Med. 121: 439-462, 1965. Fuller references can be found in the * 

20 Online Medelian Inheritance in Man; record * 114890). It has officially been renamed 

carcinoembryonic antigen-related cell adhesion molecule 5 (CEACAM5). Its expression is most 
strongly associated with adenocarcinomas of the epithelial lining of the digestive tract and in fetal 
colon. CEA is a member of the immunoglobulin supergene family and the defining member of the 
CEA subfamily. 

25 HER2/NEU is an oncogene related to the epidermal growth factor receptor (van de Vijver, 

et al., New Eng. J. Med. 319:1239-1245, 1988), and apparently identical to the C-ERBB2 oncogene 
(Di Fiore, et al., Science 237: 178-182, 1987). The over-expression of ERBB2 has been implicated 
in the neoplastic transformation of prostate cancer. As HER2 it is amplified and over-expressed in 
25-30% of breast cancers among other tumors where expression level is correlated with the 

30 aggressiveness of the tumor (Slamon, et al., New Eng. J. Med. 344:783-792, 2001). A more detailed 

description is available in the Online Medelian Inheritance in Man; record * 164870. 

All references mentioned herein are hereby incorporated by reference in their entirety. 
Further, incorporated by reference in its entirety is U.S. Patent Application No. 10/005,905 
(attorney docket number CTLIMM.02 1 CP 1 ) entitled "EPITOPE SYNCHRONIZATION IN 

35 ANTIGEN PRESENTING CELLS," filed on November 7, 2001 and a continuation thereof, U.S. 

Application No. / filed on December 7, 2000, attorney docket number 
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CTLIMM.21CP1C, also entitled "EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING 
CELLS." 

Useful epitopes were identified and tested as described in the following examples. 
However, these examples are intended for illustration purposes only, and should not be construed 

4 

5 as limiting the scope of the invention in any way. 

EXAMPLES 

Sequences of Specific Preferred Epitopes 
Example 1 

1 0 Manufacture of tyrosinase epitopes. 

A. Synthetic production of epitopes 

Peptides having an amino acid sequence of any of SEQ ID NO: 1, 8, 9, 1 1-23, 2-29, 32-44, 
47-54, 56-63, 66-68 88-253, or 256-588 are synthesized using either FMOC or tBOC solid phase 
synthesis methodologies. After synthesis, the peptides are cleaved from their supports with either 

15 trifluoroacetic acid or hydrogen fluoride, respectively, in the presence of appropriate protective 

scavengers. After removing fee acid by evaporation, fee peptides are extracted with ether to 
remove the scavengers and the crude, precipitated peptide is then lyophilized. Purity of the crude 
peptides is determined by HPLC, sequence analysis, amino acid analysis, counterion content 
analysis and other suitable means. If the crude peptides are pure enough (greater than or equal to 

20 about 90% pure), they can be used as is. If purification is required to meet drug substance 

specifications, the peptides are purified using one or a combination of the following: re- 
precipitation; reverse-phase, ion exchange, size exclusion or hydrophobic interaction 
chromatography; or counter-current distribution. 
Drug product formulation 

25 GMP-grade peptides are formulated in a parenterally acceptable aqueous, organic, or 

aqueous-organic buffer or solvent system in which they remain both physically and chemically 
stable and biologically potent. Generally, buffers or combinations of buffers or combinations of 
buffers and organic solvents are appropriate. The pH range is typically between 6 and 9. Organic 
modifiers or other excipients can be added to help solubilize and stabilize the peptides. These 

30 include detergents, lipids, co-solvents, antioxidants, chelators and reducing agents. In the case of a 

lyophilized product, sucrose or mannitol or other lyophilization aids can be added. Peptide 
solutions are sterilized by membrane filtration into their final container-closure system and either 
lyophilized for dissolution in the clinic, or stored until use. 

B. Construction of expression vectors for use as nucleic acid vaccines 

35 The construction of three generic epitope expression vectors is presented below. The 

particular advantages of these designs are set forth in U.S. Patent Application No. 09/561,572 
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entitled "EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED 
ANTIGENS," which has been incorporated by reference in its entirety above. 

A suitable E. coli strain was then transfected with the plasmid and plated out onto a 
selective medium. Several colonies were grown up in suspension culture and positive clones were 
5 identified by restriction mapping. The positive clone was then grown up and aliquotted into 
storage vials and stored at -70°C. 

A mini-prep (QIAprep Spin Mini-prep: Qiagen, Valencia, CA) of the plasmid was then 
made from a sample of these cells and automated fluorescent dideoxy sequence analysis was used 
to confirm that the construct had the desired sequence. 
10 B.l Construction of pVAX-EPMRES-EP2 

Overview: 

The starting plasmid for this construct is pVAXl purchased from Ihvitrogen (Carlsbad, 
CA). Epitopes EP1 and EP2 were synthesized by GIBCO BRL (Rockville, MD). The IRES was 
excised from pIRES purchased from Clontech (Palo Alto, CA). 
15 Procedure: 

1 pIRES was digested with EcoRI and NotL The digested fragments were separated by 
agarose gel electrophoresis, and the IRES fragment was purified from the excised band. 

2 pVAXl was digested with EcoRI and NotI, and the pVAXl fragment was gel-purified. 

3 The purified pVAXl and IRES fragments were then ligated together. 

20 4 Competent E. coli of strain DH5a were transformed with the ligation mixture. 

5 Minipreps were made from 4 of the resultant colonies. 

6 Restriction enzyme digestion analysis was performed on the mhripfrep DNA. One 
recombinant colony having the IRES insert was used for further insertion of EP1 and EP2. 
This intermediate construct was called pVAX-IRES. 

25 7 Oligonucleotides encoding EP 1 and EP2 were synthesized. 

8 EP1 was subcloned into p VAX-IRES between Afin and EcoRI sites, to make pVAX- 
EP1-IRES; 

9 EP2 was subcloned into pVAX-EPl-IRES between Sail and NotI sites, to make the 
final construct pVAX-EPl-IRES-EP2. 

30 10 The sequence of the EP1-IRES-EP2 insert was confirmed by DNA sequencing. 

B 2. Construction of pVAX-EP 1 -IRES-EP2-ISS-NIS 
Overview: 

The starting plasmid for this construct was pVAX-EPl-IRES-EP2 (Example 1). The ISS 
(immunostimulatory sequence) introduced into this construct is AACGTT, and the NIS (standing 
35 for nuclear import sequence) used is the SV40 72bp repeat sequence. ISS-NIS was synthesized by 

GIBCO BRL. See Figure 2. 
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Procedure: 

1 pVAX-EP 1-IRES-EP2 was digested with Nrul; the linearized plasmid was gel-purified. 

2 ISS-NIS oligonucleotide was synthesized. 

3 The purified linearized p VAX-EP 1 -IRES -EP2 and synthesized ISS-NIS were ligated 
5 together. 

4 Competent E. coli of strain DH5ot were transformed with the ligation product. 

5 Minipreps were made from resultant colonies. 

6 Restriction enzyme digestions of the minipreps were carried out. 

7 The plasmid with the insert was sequenced. 
10 B3. Construction of pVAX-EP2-UB-EPl 

Overview: 

The starting plasmid for this construct was pVAXl (Invitrogen). EP2 and EP1 were 
synthesized by GBBCO BRL. Wild type Ubiquitin cDNA encoding the 76 amino acids in the 
construct was cloned from yeast. 
15 Procedure: 

1 RT-PCR was performed using yeast rnRNA. Primers were designed to amplify the 
complete coding sequence of yeast Ubiquitin. 

2 The RT-PCR products were analyzed using agarose gel electrophoresis. A band with the 
predicted size was gel-purified. 

20 3 The purified DNA band was subcloned into pZEROl at EcoRV site. The resulting clone 

was named pZERO-UB. 

4 Several clones of pZERO-UB were sequenced to ccmfirm the Ubiquitin sequence before 

further manipulations. 

5 EP1 and EP2 were synthesized 

25 6 EP2, Ubiquitin and EP1 were ligated and the insert cloned into pVAXl between Bamffl 

and EcoRI, putting it under control of the CMV promoter. 
7 The sequence of the insert EP2-UB-EP 1 was confirmed by DNA sequencing. 
Example 2 

Identification of useful epitope variants. 
30 The 10-mer FLPWHRLFLL (SEQ ID NO. 1) is identified as a useful epitope. Based on 

this sequence, numerous variants are made. Variants exhibiting activity in HLA binding assays (see 

Example 3, section 6) are identified as useful, and are subsequently incorporated into vaccines. 

The HLA-A2 binding of length variants of FLPWHRLFLL have been evaluated. 

Proteasomal digestion analysis indicates that the C-terminus of the 9-mer FLPWHRLFL (SEQ ID 
35 NO. 8) is also produced. Additionally the 9-mer LPWHRLFLL (SEQ ID NO. 9) can result from N- 

terminal trimming of the 10-mer. Both are predicted to bind to the HLA-A*0201 molecule, 
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however of these two 9-mers, FLPWHRLFL displayed more significant binding and is preferred 
(see Figs. 3A and B). 

Sequence variants of FLPWHRLFL are constructed as follow. Consistent with the binding 
coefficient table (see Table 3) from the NIH/BIMAS MHC binding prediction program (see 
5 reference in example 3 below), binding can be improved by changing the L at position 9, an anchor 
position, to V. Binding can also be altered, though generally to a lesser extent, by changes at non- 
anchor positions. Referring generally to Table 3, binding can be increased by employing residues 
with relatively larger coefficients. Changes in sequence can also alter immunogenicity 
independently of their effect on binding to MHC. Thus binding and/or immunogenicity can be 
10 improved as follows: 

By substituting F,L,M,W, or Y for P at position 3; these are all bulkier residues that can 
also improve immunogenicity independent of the effect on binding. The amine and hydroxyl- 
bearing residues, Q and N; and S and T; respectively, can also provoke a stronger, cross-reactive 
response. 

15 By substituting D or E for W at position 4 to improve binding; this addition of a negative 

charge can also make the epitope more immunogenic, while in some cases reducing cross-reactivity * 
with the natural epitope. Alternatively the conservative substitutions of F or Y can provoke a « 
cross-reactive response. 

By substituting F for H at position 5 to improve binding. H can be viewed as partially 

20 charged, thus in some cases the loss of charge can hinder cross-reactivity. Substitution of the fully 

charged residues R or K at this position can enhance immunogenicity without disrupting charge- 
dependent cross-reactivity. 

By substituting J, L, M, V, F, W, or Y for R at position 6. The same caveats and 
alternatives apply here as at position 5. 

25 By substituting W or F for L at position 7 to improve binding. Substitution of V, I, S, T, Q, 

or N at this position are not generally predicted to reduce binding affinity by this model (the NIH 
algorithm), yet can be advantageous as discussed above. 

Y and W, which are equally preferred as the Fs at positions 1 and 8, can provoke a useful 
cross-reactivity. Finally, while substitutions in the direction of bulkiness are generally favored to 

30 improve immunogenicity, the substitution of smaller residues such as A, S, and C, at positions 3-7 

can be useful according to the theory that contrast in size, rather than bulkiness per se, is an 
important factor in immunogenicity. The reactivity of the thiol group in C can introduce other 
properties as discussed in Chen, J.-L., et al. J. Immunol 165:948-955, 2000. 



WO 03/008537 



47 



PCT/US02/10189 



Table 3. 9-mer Coefficient Tahle for HLA-A*0201* 



HLA Coefficient table for file "A_0201_standard" 




10 



15 



♦This table and other comparable data that are publicly available are useful in designing epitope 
variants and in detennining whether a particular variant is substantially similar, or is functionally 

similar. 
Example 3 

Cluster Analysis (SSX-231-su). 

1. Epitope cluster region prediction : 

The computer algorithms: SYFPEITHI (internet http:// access at syfpeithi.bmi- 
heidelbergxom/SmptsMHCServer.dll^pPrem^thtm), based on the hook "MHC Ligands and 
Peptide Motifs" by H.G.Rammensee, LBachmann and S.Stevanovic; and HLA Peptide Binding 
Predictions (NIH) (internet Wp-J/ access at bimas.dcrt.nih.gov/molbio/hla_bin), described in 
Parker, K. C, et al., J. Immunol 152:163, 1994; were used to analyze the protein sequence of 
SSX-2 (GL10337583). Epitope clusters (regions with higher than average density of peptide 
fragments with high predicted MHC affinity) were defined as described fully in U.S. Patent 
Application No. 09/561,571 entitled "EPITOPE CLUSTERS," filed on April 28, 2000. Using a 
epitope density ratio cutoff of 2, five and two clusters were defined using the SYFPETHI and NIH 
algorithms, respectively, and peptides score cutoffs of 16 (SYFPETHI) and 5 (NIH). The highest 
scoring peptide with the NIH algorithm, SSX-2 4M „ with an estimated halffcme of dissociation of 
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>1000 min., does not overlap any other predicted epitope but does cluster with SSX-257-65 in the 
NIH analysis. 

2. Peptide synthesis and characterization : 

SSX-231-68, YFSOiEWElOVlKAS (SEQ E> NO. 

5 10) was synthesized by MPS (Multiple Peptide Systems, San Diego, CA 92121) using standard 
solid phase chemistry. According to the provided 'Certificate of Analysis', the purity of this 
peptide was 95%. 

3. Proteasome digestion : 

Proteasome was isolated from human red blood cells using the proteasome isolation 
10 protocol described in U.S. Patent Application No. 09/561,074 entitled "METHOD OF EPITOPE 
DISCOVERY," filed on April 28, 2000. SDS-PAGE, western-blotting, and ELISA were used as 
quality control assays. The final concentration of proteasome was 4 mg/ml, which was determined 
by non-interfering protein assay (Geno Technologies Inc.). Proteasomes were stored at -70°C in 25 
ul aliquots. 

15 SSX-2 31 .6 8 was dissolved in Milli-Q water, and a 2 mM stock solution prepared and 20uL 

aliquots stored at -20°C. 

1 tube of proteasome (25 uL) was removed from storage at-70°C and thawed on ice. It was 
then mixed thoroughly with 12.5uL of 2mM peptide by repipetting (samples were kept on ice). A 
5uL sample was immediately removed after mixing and transferred to a tube containing 1.25uL 

20 10%TFA (final concentration of TFA was 2%); the T=0 min sample. The proteasome digestion 

reaction was then started and carried out at 37°C in a programmable thermal controller. Additional 
5uL samples were taken out at 15, 30, 60, 120, 180 and 240 min respectively, the reaction was 
stopped by adding the sample to 1.25uL 10% TFA as before. Samples were kept on ice or frozen 
until being analyzed by MALDI-MS. All samples were saved and stored at -20°C for HPLC 

25 analysis and N-terminal sequencing. Peptide alone (without proteasome) was used as a blank 

control: 2 uL peptide + 4pL Tris buffer (20 mM, pH 7.6) + 1 .5uL TFA 

4. MAT.DT-T QF MS measurements : 

For each time point 0.3 uL of matrix solution (lOmg/ml a-cyano-44iydroxycirmarnic acid 
in AcCN/H 2 0 (70:30)) was first applied on a sample slide, and then an equal volume of digested 

30 sample was mixed gently wiih matrix solution on the slide. The slide was allowed to dry at 
ambient air for 3-5 min. before acquiring the mass spectra. MS was performed on a Lasermat 2000 
MALDI-TOF mass spectrometer that was calibrated with peptide/protein standards. To improve 
the accuracy of measurement, the molecular ion weight (MH*) of the peptide substrate was used as 
an internal calibration standard. The mass spectrum of the T= 120 min. digested sample is shown in 

35 figure 4. 
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5. MS data analysis and epitope identification : 

To assign the measured mass peaks, the computer program MS-Product, a tool from the 
UCSF Mass Spectrometry Facility (http:// accessible at 

pTOSpector.ucsf.edu/ucsfhtml34/msprod.htm), was used to generate all possible fragments (N- and 
C-terminal ions, and internal fragments) and their corresponding molecular weights. Due to the 
sensitivity of the mass spectrometer, average molecular weight was used. The mass peaks 
observed over the course of the digestion were identified as summarized in Table 4. 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
die SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 5. 
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Tflhle 4. RSX-2, K « Mass Peak Identification. 



MS PEAK 
(measured) 



988.23 



1377.68±2,3 
8 



PEPTIDE 



31-37 



SEQUENCE 



YFSKEEW 



1662.45*1.3 
0 



2181.72±0.8 
5 



31-40 



31-43 



31-47 



YFSKEEWEKM 



YFSKEEWEKMKAS 



YFSKEEWEKMKASEKIF 



CALCULATED 
MASS (MH 4 ") 



989.08 



1377.68 



1663.90 



2181.52 



2346.6 



3M8 



YFSKEEWEKMKAS EKIFY 



1472.16*1.5 
4 



38-49 



EKMKASEKIFYV 



2445.78±1.1 
8 



2607. 



1563.3 



3989.9 



31-49* 



YFSKEEWEKMXASBKIFYV 



31-50 



YFSKEEWEKMKASEKIFYVY 



50-61 



YMKRKYEAMTKL 



1603.74±L5 
3 



1766.45±1.5 



1866.32*1.2 
2 



4192.6 



4392.1 



31-61 



YFS KEEWEKMKAS EKI FYVYMKRKYEAMTKL 



51-63 



MKRKYEAMTKLGF 



50-63 



YMKRKYEAMTKLGF 



49-63 



VYMKRKYEAMTKLQF 



31-63 



31-65** 



YFSKEEWEKMKAS EKI FYVYMKRKYEAMTKLG 
F 



YFSKEEWEKMKASEKIFYVYMKRKYEAMTKLG 
FKA 



! 



2344.71 



1473.77 



2443.84 



2607.02 



1562.93 



3987.77 



1603.98 



1767.16 



1866.29 



4192.00 



4391.25 



Boldface sequence correspond to peptides predicted to bind to MHC. 

* On the basis of mass alone this peak could also have been assigned to the peptide 32-50, however 
proteasomal removal of just the N-terrninal amino acid is unlikely. N-terminal sequencing (below) 
verifies the assignment to 31-49. 

** On the basis of mass this fragment migjht also represent 33-68. N-terminal sequencing below is 
consistent with the assignment to 31-65. 
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Tahlfi 5. Predicted HLA binding bv proteasomallv generated fragments 



SEO ID NO. 


PEPTIDE 






NIH 


11 


FSKEEWEKM 


B*3501 


NrJ 


on 


12 


KMKASEKIF 


B*08 


17 


<J 


13 & (14) 


(K) MKASEKIFY 


Al 


19 (19) 


<5 | 


15 &(16) 


(M) KASEKIFYV 


A*0201 


22 (16) 


1017 


B*08 


17 


<5 


B*5101 


22 (13) 


60 


B*5102 


NP 


133 


B*5103 


MP 


121 


17 & (18) 


(K)ASEKIFYVY 


Al 


34(19) 


14 


19 & (20) 


(K)RKYEAMTKL 


A*0201 


15 


<5 


A26 


15 


NP 




NP 


45 C60) 


B*2705 


21 


15 


B*2709 


16 


NP 


B*5101 


15 


<5 


21 

22 
23 


KYEAMTKLGF 

YEAMTKLGF 
EAMTKLGF 


Al 


16 


<5 


A24 


NP 


300 


B*4403 


NP 


80 


B*08 


22 


<5 



tNo prediction 



5 

As seen in Table 5, N-tenninal addition of authentic sequence to epitopes can generate epitopes for 
the same or different MHC restriction elements. "Note in particular the pairing of 
(K)RKYEAMTKL (SEQ ID NOS 19 and (20)) with HLA-B14, where the 10-mer has a longer 

10 predicted halftime of dissociation than the co-C-terminal 9-mer. Also note the case of the 10-mer 

KYEAMTKLGF (SEQ ID NO. 21) which can be used as a vaccine useful with several MHC types 
by relying on N-terminal trimming to create the epitopes for HLA-B*4403 and -B*08. 
6. HLA-A0201 binding assay : 

Binding of the candidate epitope KASEKIFYV, SSX-2 4 M9, (SEQ ID NO. 15) to HLA- 

15 A2.1 was assayed using a modification of the method of Stauss et al., (Proc Natl Acad Sci USA 

89(17):7871-5 (1992)). Specifically, T2 cells, which express empty or unstable MHC molecules 
on their surface, were washed twice with Iscove's modified Dulbecco's medium (1MDM) and 
cultured overnight in serum-free AIM-V medium (Life Technologies, Inc., Rockville, MD) 
supplemented with human 62-microglobulin at 3ug/ml (Sigma, St. Louis, MO) and added peptide, 
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at 800, 400, 200, 100, 50, 25, 12.5, and 6.25 (igfaLin a 96-well flat-bottom plate at 3xl0 5 cells/200 
Hl/welL Peptide was mixed with the cells by repipeting before distributing to the plate 
(alternatively peptide can be added to individual wells), and the plate was rocked gently for 2 
minutes. Incubation was in a 5% C0 2 incubator at 37°C. The next day the unbound peptide was 

5 removed by washing twice with serum free RPMI medium and a saturating amount of anti-class I 
HLA monoclonal antibody, fluorescein isothiocyanate (FTTC)-conjugated anti-HLA A2, A28 (One 
Lambda, Canoga Park, CA) was added. After incubation for 30 minutes at4°C, cells were washed 
3 times with PBS supplemented with 0.5% BSA, 0.05%(w/v) sodium azide, pH 7.4-7.6 (staining 
buffer). (Alternatively W6/32 (Sigma) can be used as the anti-class I HLA monoclonal antibody 

10 the cells washed with staining buffer and then incubated with fluorescein isothiocyanate (FTTC)- 
conjugated goat F(ab') antimouse-IgG (Sigma) for 30 min at 4°C and washed 3 times as before.) 
The cells were resuspended in 0.5 ml staining buffer. The analysis of surface HLA-A2.1 molecules 
stabilized by peptide binding was performed by flow cytometry using a FACScan (Becton 
Dickinson, San Jose, CA). If flow cytometry is not to be performed immediately the cells can be 

15 fixed by adding a quarter volume of 2% paraformaldehyde and storing in the dark at 4 C. 

The results of the experiment are shown in Figure 5. SSX-2 4M9 (SEQ ID NO. 15) was 
found to bind HLA-A2.1 to a similar extent as the known A2.1 binder ELPSDYFPSV (HBV 18 _ 27 ; 
SEQ ID NO: 24) used as a positive control. An HLA-B44 binding peptide, AEMGKYSFY (SEQ 
ID NO: 25), was used as a negative control. The fluoresence obtained from the negative control 

20 was similar to the signal obtained when no peptide was used in the assay. Positive and negative 

control peptides were chosen from Table 18.3.1 in Current Protocols in Immunology p. 18.3.2, 
John Wiley and Sons, New York, 1998. 
7. Immunogenicitv : 

A. In vivo immunization of mice . 

25 HHD1 transgenic A*0201 mice (Pascolo, S., et al. J. Exp, Med. 185:2043-2051, 1997) 

were anesthetized and injected subcutaneously at the base of the tail, avoiding lateral tail veins, 
using 100 pi containing 100 nmol of SSX-2 4M9 (SEQ ID NO. 15) and 20 ^g of HTL epitope 
peptide in PBS emulsified with 50 \xl of EPA (incomplete Freund's adjuvant). 

B. Preparation of stimulating cells (LP S blasts). 

30 Using spleens from 2 naive mice for each group of immunized mice, un-immunized mice 

were sacrificed and the carcasses were placed in alcohol. Using sterile instruments, the top dermal 
layer of skin on the mouse's left side (lower mid-section) was cut through, exposing the 
peritoneum. The peritoneum was saturated with alcohol, and the spleen was aseptically extracted. 
The spleen was placed in a petri dish with serum-free media. Splenocytes were isolated by using 

35 sterile plungers from 3 ml syringes to mash the spleens. Cells were collected in a 50 ml conical 

tubes in serum-free media, rinsing dish well. Cells were centrifuged (12000 rpm, 7 min) and 
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washed one time with RPMI. Fresh spleen cells were resuspended to a concentration of lxlO 6 
cells per ml in RPMI-10%FCS (fetal calf serum). 25g/ml lipopolysaccharide and 7 ug/ml Dextran 
Sulfate were added. Cell were incubated for 3 days in T-75 flasks at 37°C, with 5% CO2. Splenic 
blasts were collected in 50 ml tubes pelleted (12000 rpm, 7 min) and resuspended to 3X10^wl in 
5 RPMI. The blasts were pulsed with the priming peptide at 50 ug/ml, RT 4hr. mitomycin C-treated 
at 25ug/ml, 37°C, 20 min and washed three times with DMEM. 

C. In vitro stimulation . 

3 days after LPS stimulation of the blast cells and the same day as peptide loading, the 
primed nrice were sacrificed (at 14 days post immunization) to remove spleens as above. 3xl0 6 
10 splenocytes were co-cultured with 1x10°" LPS blasts/well in 24-well plates at 37°C, with 5% C0 2 

in DMEM media supplemented with 10% FCS, 5xl0" 5 M p-mercaptoethanol, lOOug/ml 
streptomycin and 100 IU/ml penicillin. Cultures were fed 5% (voVvol) ConA supernatant on day 3 
and assayed for cytolytic activity on day 7 in a 5 ^-release assay. 

D. Chromium-release assav measuring CTL activity. 

15 To assess peptide specific lysis, 2xl0 6 T2 cells were incubated with 100 uCi sodium 

chromate together with 50 ug/ml peptide at 37 C for 1 hour. During incubation they were 
gently shaken every 15 minutes. After labeling and loading, cells were washed three times with 10 
ml of DMEM-10% FCS, wiping each tube with a fresh Kimwipe after pouring off the supernatant. 
Target cells were resuspended in DMEM-10% FBS lxloVml. Effector cells were adjusted to 

20 lxl0 7 /ml in DMEM-10% FCS and 100 ul serial 3-fold dilutions of effectors were prepared in U- 

bottom 96-well plates. 100 ul of target cells were added per well. In order to determine 
spontaneous release and maximum release, six additional wells containing 100 ul of target cells 
were prepared for each target Spontaneous release was revealed by incubating the target cells with 
100 ul medium; maximum release was revealed by incubating the target cells with lOOul of 2% 

25 SDS. Plates were then centrifuged for 5 min at 600 rpm and incubated for 4 hours at 37°C in 5% 
CO2 and 80% humidity. After the incubation, plates were then centrifuged for 5 min at 1200 rpm. 
Supematants were harvested and counted using a gamma counter. Specific lysis was determined as 
follows: % specific release = [(experimental release - spontaneous release)/(maximum release - 

spontaneous release)] x 100. 
30 Results of the chromium release assay demonstrating specific lysis of peptide pulsed target 

cells are shown in figure 6. 

8. Qoss-reactivitv with other SSX proteins : 

SSX-24M9 (SEQ ID NO. 15) shares a high degree of sequence identity with the same region 
of the other SSX proteins. The surrounding regions have also been generally well conserved. Thus 
35 the housekeeping proteasome can cleave following V 4 , in all five sequences. Moreover, SSX, M9 is 
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predicted to bind HLA-A*0201 (see Table 6). CTL generated by immunization with SSX-2 4M 9 
cross-react with tumor cells expressing other' SSX proteins. 



Table 6. SSX^o - A*0201 Predicted Binding 



SEQ ID NO. 


Family 
Member 


Sequence 


SYFPEITffl 
Score 


MH 
Score 


15 


SSX-2 


KASEKJFYV 


22 


1017 


26 


SSX-1 


KYSEKISYV 


18 


1.7 


27 


SSX-3 


KVSEKJVYV 


24 


1105 


28 


SSX-4 


KSSEKIVYV 


20 


82 


29 


SSX-5 


KASEKHYV 


22 


175 



5 Example 4 

Cluster Analysis (PSMAus-m)* 

A peptide, AFSP(^MPEGDLVYVNYARTEDFFKLERDM, PSMAiq-im. (SEQ ID NO. 
30), containing an Al epitope cluster from prostate specific membrane antigen, PSMA168-190 (SEQ 
ID NO. 31) was synthesized using standard solid-phase F-moc chemistry on a 433A ABI Peptide 

10 synthesizer. After side chain deprotection and cleavage from the resin, peptide first dissolved in 

fomic acid and then diluted into 30% Acetic acid, was run on a reverse-phase preparative HPLC 
C4 column at following conditions: linear AB gradient ( 5% B/min) at a flow rate of 4 ml/min, 
where eluent A is 0.1% aqueous TFA and eluent B is 0.1% TFA in acetonitrile. A fraction at time 
16.642 min containing the expected peptide, as judged by mass spectrometry, was pooled and 

15 lyophilized. The peptide was then subjected to proteasome digestion and mass spectrum analysis 

essentially as described above. Prominent peaks from the mass spectra are summarized in Table 7. 
Table 7. PSM Ano-io? Mass Peak Identification. 



PEPTIDE 


SEQUENCE 


CALCULATE 
D MASS (MH 4 ) 


163-177 


AFSPQGMPEGDLVYV 


1610.0 


178-18? 


NYARTEDPFKLE 


1533.68 


170-189 


PEGDLVYVNYARTEDPFKLE 


2406.66 


178-191 


NYARTEDFFKLERD 


1804.95 


170-191 


PEGDLVYVNYARTEDFFKLERD 


2677.93 


178-192 


NYARTEDFFKLERDM 


1936.17 


163-176 


AF S P QGMP EGDLVY 


1511.70 


177-192 


VNYARTEDFFKLERDM 


2035.30 


163-179 


AFSPQGMPEGDLVYVNY 


1888.12 



WO 03/008537 



55 



PCT/US02/10189 



180-192 


ARTEDFFKLERDM 


1658.89 


163-183 


AFS PQGMPEGDLVYVNYARTE 


2345.61 


184492 


DFFKLERDM 


1201.40 


176492 


YVNYARTEDFFKLERDM 


2198.48 


167485 


QGMPEGDLVYVNYARTEDF 


2205.41 


1784 86 


NYARTEDFF 


1163.22 



Boldface sequences correspond to peptides predicted to bind to MHC, see Table 8. 



N-terminal Pool Sequence Analysis 

■ 

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was 
5 subjected to N-terminal amino acid sequence analysis by an ABI 473 A Protein Sequencer (Applied 

Biosystems, Foster City, CA). Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
10 N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to 
sequences, these data also provide a more reliable indication of the relative yield of the various 
fragments than does mass spectrometry. 

For PSMAkj3. 192 (SEQ ID NO. 30) this pool sequencing supports a single major cleavage 
site after Vm and several minor cleavage sites, particularly one after Ym. Reviewing the results 
15 presented in figures 7A-C reveals the following: 

S at the 3 rd cycle indicating presence of the N-terminus of the substrate. 

Q at the 5 th cycle indicating presence of the N-tenrrinus of the substrate. 

N at the 1 st cycle indicating cleavage after Vm. 

N at the 3 rt cycle indicating cleavage after V 175 . Note the fragment 176-192 in Table 7. 
20 T at the 5 th cycle indicating cleavage after V 177 . 

T at the I st -3 rd cycles, indicating increasingly common cleavages after Rm, Aj 80 and Y179. 
Only the last of these correspond to peaks detected by mass spectrometry; 163-179 and 
180-192, see Table 7. The absence of the others can indicate that they are on 
fragments smaller than were examined in the mass spectrum. 
25 K at the 4 th , 8* and 10* cycles indicating cleavages after E 183 , Y 179 , and V I77 , respectively, 

all of which correspond to fragments observed by mass spectroscopy. See Table 7. 

A at the 1 st and 3 ri cycles indicating presence of the N-terminus of the substrate and 
cleavage after V 177 , respectively. 

P at the 4 th and 8 th cycles indicating presence of the N-terminus of the substrate. 
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G at the 6 th and 10 th cycles indicating presence of the N-terminus of the substrate. 
M at the 7 th cycle indicating presence of the N-terminus of the substrate and/or cleavage 
after Figs. 

M at the 15 th cycle indicating cleavage after V177. 
5 The 1 st cycle can indicate cleavage after Di 9l , see Table 7. 

R at the 4* and 13 th cycle indicating cleavage after V177. 
R at the 2 nd and 1 1 th cycle indicating cleavage after Y179. 

V at the 2 nd , 6 ft , and 13 th cycle indicating cleavage after V175, M 16 9 and presence of the N- 

terminus of the substrate, respectively. Note fragments beginning at 176 and 170 in 
10 Table 7. 

Y at the 1 st , 2 nd , and 14* cycles indicating cleavage after Vns, V177, and presence of the N- 

terminus of the substrate, respectively. 
L at thell* 1 and 12 th cycles indicating cleavage after V 177 , and presence of die N-terminus 
of the substrate, respectively, is the interpretation most consistent with the other data. 
15 Comparing to the mass spectrometry results we see that L at the 2 nd , 5 th , and 9 th cycles 

is consistent with cleavage after Ftg6, E183 or Mi 69 , and Yng, respectively. See Table 7. 
Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further analysis. The digestion and prediction 
20 steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include a predicted HLA-A1 
binding sequence, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 8, 
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Table 8. Predicted HLA binding by proteasomally generated fragments 



SEQ ID NO 


PEPTIDE 


HLA 


SYFPEITIH 


Nm 


32 & (33) 


(G)MPEGDLVY 
V 


A*020l 


17 (27) 


(2605) 






B*0702 


20 


<5 






B*510l 


22 


314 


34 & (35) 


(Q) GMPE6DLV 
Y 


Al 


24(26) 


<5 






A3 


16 (18) 


36 






B*2705 


17 


25 


36 


MPEGDLVY 


B*5101 


15 


NPt 

i 


37 & (38) 


(P) EGDLVYVN 
Y 


Al 


27(15) 


12 






A26 


23 (17) 


NP 


39 


LVYVNYARTE 


A3 


21 


<5 


40 & (41) 


(Y)VNYARTED 
F 


A26 


(20) 


«M 1 ■ V 

NP 






B*08 


15 


<5 






B*2705 


12 


50 


42 


NYARTEDFF 


A24 


NPt 


100 






Cw*0401 


NP 


120 


43 


YARTEDFF 


B*08 


16 


<5 


44 


RTEDFFKLE 


Al 


21 


<5 






A26 


15 


NP 



tNo prediction 



5 HLA-A*0201 binding assay: 

HLA-A*0201 binding studies were preformed with PSMAio-m, GMPEGDLVYV, (SEQ 
ID NO. 33) essentially as described in Example 3 above. As seen in figure 8, this epitope exhibits 
significant binding at even lower concentrations than the positive control peptides. The Melan-A 
peptide used as a control in this assay (and throughout this disclosure), ELAGIGILTV, is actually a 
1 0 variant of the natural sequence (EAAGIGILTV) and exhibits a high affinity in this assay. 
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Example 5 

Cluster Analysis (PSMAukmo)- 

Another peptide, RGIAEAVGLPSffVHPIGYYDAQKLLEKMG, PSMA 28 i-3io, (SEQ ID 
NO. 45), containing an Al epitope cluster from prostate specific membrane antigen, PSMA 283 -307 
5 (SEQ ID NO. 46), was synthesized using standard solid-phase F-moc chemistry on a 433A ABI 

Peptide synthesizer. After side chain deprotection and cleavage from the resin, peptide in ddH20 
was run on a reverse-phase preparative HPLC CI 8 column at following conditions: linear AB 
gradient (5% B/min) at a flow rate of 4 ml/min, where eluent A is 0.1% aqueous TFA and eluent B 
is 0.1% TFA in acetonitrile. A fraction at time 17.061 min containing the expected peptide as 
10 judged by mass spectrometry, was pooled and lyophilized. The peptide was then subjected to 
proteasome digestion and mass spectrum analysis essentially as described above. Prominent peaks 
from the mass spectra are summarized in Table 9. 



Table 9. PSMAiai.nn Mass Peak Identification. 



PEPTIDE 


SEQUENCE 


CALCULATE 
D MASS (MET) 


281-297 


RGIAEAVGLPSIPVHPI* 


1727.07 


286-297 


AVGLPSIPVHPI** 


1200.46 


287-297 


VGLPSIPVHPI 


1129.38 


288-297 


GLPSIPVHPI T 


1030.25 


298-310 


GYYDAQKLLEKMG* 


1516.5 


298-305 


GYYDAQKL§ 


958.05 


281-305 


RGIAEAVGLPS I PVHPIGYYDAQKL 


2666.12 


281-307 


RGIAEAVGLPS I PVHP IGYYDAQKLLE 


2908.39 


286-307 


AVGLPSI PVHPIGYYDAQKLLEU 


2381.78 


287-307 


VGLPS I PVHP IGYYDAQKLLE 


2310.70 


288-307 


GLPS I PVHPIGYYDAQKLLE# 


2211.57 


281-299 


RGIAEAVGLPS IPVHP IGY 


1947 


286-299 


AVGLPSIPVHPIGY 


1420.69 


287-299 


VGLPSIPVHPIGY 


1349.61 


288-299 


GLPSIPVHPIGY 


1250.48 


287-310 


VGLPS I PVHP IGYYDAQKLLEKMG 


2627.14 


288-310 


GLPS I PVHPIGYYDAQKLLEKMG 


2528.01 



Boldface sequences correspond to peptides predicted to bind to MHC, see Table 10. 



15 *By mass alone this peak could also have been 296-310 or 288-303. 

**By mass alone this peak could also have been 298-307. Combination of HPLC and mass 
spectrometry show that at some later time points this peak is a mixture of both species, 
■f By mass alone this peak could also have been 289-298. 

8- 
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By mass alone this peak could also have been 281-295 or 294-306. 
§ By mass alone this peak could also have been 297-303. 
% By mass alone this peak could also have been 285-306. 
# By mass alone this peak could also have been 288-303. 
5 None of these alternate assignments are supported N-terminal pool sequence analysis. 

N-terminal Pool Sequence Analysis 

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was 
subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (Applied 

1 0 Biosystems, Foster City, CA) . Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
N-terminal direction. In addition to helping resolve any ambiguity in the assignment of mass to 

15 sequences, these data also provide a more reliable indication of the relative yield of the various 

fragments than does mass spectrometry. 

For PSM A2Bi.3io( SEQ ED NO. 45) this pool sequencing supports two major cleavage sites 
after V287 and I297 among other minor cleavage sites. Reviewing the results presented in Fig. 9 
reveals the following: 

20 S at the 4 th and 1 1 th cycles indicating cleavage after V 2 87 and presence of the N-terminus of 

the substrate, respectively. 
H at the 8 th cycle indicating cleavage after V 287 . The lack of decay in peak height at 
positions 9 and 10 versus the drop in height present going from 10 to 11 can suggest . 
cleavage after A 28 $ and E285 as well, rather than the peaks representing latency in the 
25 sequencing reaction. 

D at the 2 nd , 4 th , and 7 th cycles indicating cleavages after Yjsp, I297, and V^, respectively. 
This last cleavage is not observed in any of the fragments in Table 10 or in the 
alternate assignments in the notes below. 
Q at the 6 th cycle indicating cleavage after I 297 . 
30 M at the 10 th and 12 th cycle indicating cleavages after Y299 and I297, respectively. 

Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
35 in proteasomal digest described here was specifically designed to include a predicted HLA-A1 

binding sequence, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 10. 



9- 



WO 03/008537 



60 



PCTVUS02/10189 



Table 10. 

Predicted HI, A binding bv proteasomallv generated fragments: PSlVLW^n 



SEQ ID NO. 


PEPTIDE 


HLA 


SYFPErrm 


NIH 


47 & (48) 


(G) liPSIPVH 
PI 


A*0201 


16 (24) 


(24) 


B*0702/B7 


23 


12 


B*5101 


' 24 


572 


Cw*0401 


NPf 


20 


49 & (50) 


(P) IGYYDAQ 
KL 


A*0201 


(16) 


<5 


A26 


(20) 


NP 


B*2705 


16 


25 


B*2709 


15 


NP 


B*5101 


21 


57 


Cw*0301 


NP 


24 


51 & (52) 
53 


(P)SIPVHPI 
GY 

IPVHPIGY 


Al 


21(27) 


<5 


A26 


22 


NP 


A3 


16 


<5 


B*5101 


16 


NP 


54 


YYDAQKLLE 


Al 


22 


<5 



fNo prediction 



As seen in Table 10, N-tenninal addition of authentic sequence to epitopes can often generate still . 
useful, even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (G)LPSIPVHPI with HLA-A*0201, where the 10-mer can be used as a vaccine 
useful with several MHC types by relying on N-tenninal trinuning to create the epitopes for 
10 HLA-B7, -B*5101, andCw*0401. 

HLA-A*0201 binding assay: 

HLA-A*0201 binding studies were preformed with PSMA 288 -297, GLPSIPVHPI, (SEQ ID 
NO. 48) essentially as described in Examples 3 and 4 above. As seen in figure 8, this epitope 
exhibits significant binding at even lower concentrations than the positive control peptides. 
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Example 6 

Cluster Analysis (PSMA4S4-481). 

Another peptide, SSffiGNYTLRVDCTPLMYSLVHLTKEL, PSMA^ai, (SEQ ID NO. 
55) containing an epitope cluster from prostate specific membrane antigen, was synthesized by 
5 MPS (purity >95%) and subjected to proteasome digestion and mass spectrum analysis as 
described above. Prominent peaks from the mass spectra are summarized in Table 11. 



1 

Table 11. PSMA^si Mass Peak Identification. 



MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE 


CALCULATED 
MASSOVfH*) 


1238.5 


454-464 


SSIEGNYTLRV 


1239.78 


1768.38±0.60 


454-469 


SSI EGNYTLRVDCTPL 


1768.99 


1899.8 


454-470 


SSI EGNYTLRVDCTPLM 


1900.19 


1097.63^0.91 


463-471 


RVDCTPLMY 


1098.32 


2062.87±0.68 


454-471* 


SS I EGNYTLRVDCTPLMZ 


2063.36 


1153 


472-481** 


SLVHNLTKEL 


1154.36 


1449.93il.79 


470-481 


MYS LVHNLTKEL 


1448.73 



10 Boldface sequence correspond to peptides predicted to bind to MHC, see Table 12. 

* On the basis of mass alone this peak could equally well be assigned to the peptide 455-472 
however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the issue 
were important it could be resolved by N-terminal sequencing. 
**On the basis of mass this fragment might also represent 455-464. 

15 

Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although fee substrate peptide used 
20 in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected Tesults are shown in Table 12. 
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Table 12. Predicted HLA binding by proteasomaUy generated fragments 



SEO ID NO 


PEPTIDE 


BDLA 


SYFPEITBDE 


NlH 


56 & (57) 


(S) IEGNYTLRV 


Al 


(19) 


<5 


58 


BGNYTLRV 


A*0201 


16 (22) 


<5 


B*5101 


15 


NPt 


59 & (60) 


( Y ) TLRVDCTPL 


A*0201 


20 (18) 


(5) 


A26 


16 (18) 


NP 


B7 


14 


40 


B8 


23 


<5 


B*2705 


12 


30 


Cw*0301 


NP 


(30) 


61 


LRVDCTPLM 


B*2705 


20 


600 


B*2709 


20 


NP 


62 & (63) 




(L) RVDCTPLMY 




Al 


32 (22) 


125 (13.5) 


A3 


25 


<5 


A26 


22 


NP 


B*2702 


NP 


(200) 


B*2705 


13 (NP) 


(1000) 



fNo prediction 

5 As seen in Table 12, N-terminal addition of authentic sequence to epitopes can often generate still 

useful even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (L)RVDCTPLMY (SEQ ID NOS 62 and (63)) with HLA-B*2702/5, where the 10- 
mer has substantial predicted halftimes of dissociation and the co-C-terminal 9-mer does not. Also 
note the case of SEGNYTLRV (SEQ ID NO 57) a predicted HLA-A*0201 epitope which can be 
10 used as a vaccine useful with HLA-B*5101 by relying on N-terminal trimming to create the 

epitope. 

HLA-A*O201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMAum* TLRVDCTPL, (SEQ ID NO. 60). As seen in figure 10, this epitope was 
15 found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV^; 

SEQ ID NO: 24) used as a positive control. Additionally, PSMA^m^ , (SEQ ID NO. 59) binds 
nearly as well. 

ELISPOT analysis: PSMA^ i (SEQ ID NO. 62) 
20 The wells of a nitrocellulose-backed microtiter plate were coated with capture antibody by 

incubating overnight at 4°C using 50 ul/well of 4ug/ml murine anti-human y-JJFN monoclonal 



WO 03/008537 



63 



PC17US02/10189 



antibody in coating buffer (35 mM sodium bicarbonate, 15 mM sodium carbonate, pH 9.5). 
Unbound antibody was removed by washing 4 times 5 min. with PBS. Unbound sites on the 
membrane then were blocked by adding 200fil/well of RPMI medium with 10% serum and 
incubating 1 hr. at room temperature. Antigen stimulated CD8 + T cells, in 1:3 serial dilutions, 
5 were seeded into the wells of the microtiter plate using lOOul/well, starting at 2xl0 5 cells/well. 

(Prior antigen stimulation was essentially as described in Scheibenbogen, C. et al. InL J. Cancer 
71:932-936, 1997. PSMA*^?, (SEQ ID NO. 62) was added to a final concentration of lOug/ml 
and IL-2 to 100 U/ml and the cells cultured at 37°C in a 5% Ofy, water-saturated atmosphere for 
40 hrs. Following this incubation the plates were washed with 6 times 200 ul/well of PBS 

10 containing 0.05% Tween-20 (PBS-Tween). Detection antibody, 50ul/well of 2g/ml biotinylated 
murine anti-human y-IFN monoclonal antibody in PBS+10% fetal calf serum, was added and the 
plate incubated at room temperature for 2 hrs. Unbound detection antibody was removed by 
washing with 4 times 200 ul of PBS-Tween. lOOul of avidin-conjugated horseradish peroxidase 
(Pharmingen, San Diego, CA) was added to each well and incubated at room temperature for i hr. 

15 Unbound enzyme was removed by washing with 6 times 200 ul of PBS-Tween. Substrate was - 

prepared by dissolving a 20 mg tablet of 3-arnino 9-ethylcoarbasole in 2.5 ml of N, N-< 
dimethylformamide and adding that solution to 47,5 ml of 0.05 M phosphate-citrate buffer (pH 
5.0). 25 ul of 30% H2O2 was added to the substrate solution immediately before distributing 
substrate atlOO uJ/well and incubating the plate at room temperature. After color development 

20 (generally 15-30 min.), the reaction was stopped by washing the plate with water. The plate was 

air dried and the spots counted using a stereomicroscope. 

Figure 1 1 shows the detection of PSMA463-471 (SEQ ID NO. 62)-reactive HLA-A1 + CD8 + T 
cells previously generated in cultures of HLA-A1* CD8 + T cells with autologous dendritic cells 
plus the peptide. No reactivity is detected from cultures without peptide (data not shown). In this 

25 case it can be seen that the peptide reactive T cells are present in the culture at a frequency between 

1 in 2.2xl0 4 and 1 in 6.7xl0 4 . That this is truly an HLA-A1 -restricted response is demonstrated by 
the ability of anti-HLA-Al monoclonal antibody to block y-IFN production; see figure 12. 
Example 7 

Ouster Analysis (PSMA^sj^g?). 
30 Another peptide, FDKSNPIVLRMMNDQLMFLE^ PSMA^s?, 

(SEQ ID NO. 64) containing an A2 epitope cluster from prostate specific membrane antigen, 
PSMA^fio^sj (SEQ ED NO 65), was synthesized by MPS (purity >95%) and subjected to proteasome 
digestion and mass spectrum analysis as described above. Prominent peaks from the mass spectra 
are summarized in Table 13. 
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PSMA* 



MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE 


CALCULATED 
MASS (MIT) 


906.17±0.65 


681-687** 


LPDRPFY 


908.05 


1287.73±0.76 

* www *^ v ♦ f >^ 


677-687** 


DPLGLPDRPFY 


1290.47 


1400.3*1.79 


676-687 


IDPIiGLPDRPFY 


1403.63 


1548.0±1.37 


675-687 


FIDPLGLPDRPFY 


1550.80 


1619.5*1.51 


674-687** 


AFIDPLGLPDRPFY 


1621.88 


1775.48il.32 


673-687* 


RAFIDPLGLPDRPFY 


1778.07 


2440.2±1.3 


653-672 


FDKSNPIVLRMMNDQLMFLE 


2442.93: 


1904.63*1.56 


672-687* 


ERAF IDPLGLPDRPFY 


1907.19 


2310.6*2.5 


653-671 


FDKSNPIVLRMMNDQLMFL 


2313.82 


2017.4*1,94 


671-687 


LERAFI DPLGLPDRPFY 


2020.35 


2197.43il.78 


653-670 


FDKSNPIVLRMMNDQLMF 


2200.66 



Boldface sequence correspond to peptides predicted to bind to MHC, see Table 13. 
* On the basis of mass alone this peak could equally well be assigned to a peptide beginning at 
654, however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the 
issue were important it could be resolved by N-terminal sequencing. 

** On the basis of mass alone these peaks could have been assigned to internal fragments, but 
given the overall pattern of digestion it was considered unlikely. 



Epitope Identification 

10 Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 

the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 

1 5 predicted binding to other MHC molecules. Selected results are shown in Table 14. 
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Table 14. Predicted HLA binding by proteasomally generated fragments 



SEQ ID NO 


PEPTIDE 


HLA 


SYFPEITHI 


NTH 


66 & (67) 


(R)MMNDQLMF 
L 


A*0201 


24 (23) 


1360 (722) 


A*0205 


NPf 


71 (42) 


A26 


15 


NP 


B*2705 


12 


50 


68 


RMMNDQLMF 


B*2705 


17 


75 



|No prediction 

5 

As seen in Table 14, N-terminal addition of authentic sequence to epitopes can generate 
still useful, even better epitopes, for the same or different MHC restriction elements. Note for 
example the pairing of (R)MMNDQLMFL (SEQ ID NOS. 66 and (67)) with HLA-A*02, where the 
10-mer retains substantial predicted binding potential. 

10 

HLA-A*0201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMUWm, (SEQ ID NO. 66) and PSMA«K7i, KMMNDQLMFL (SEQ NO. 67). As 
seen in figures 10, 13 and 14, this epitope exhibits significant binding at even lower concentrations 
15 than the positive control peptide (FLPSDYFPSV (HBV 18 . 27 ); SEQ ID NO: 24). Though not run in 

parallel, comparison to the controls suggests that PSMA$62-67i (which approaches the Melan A 
peptide in affinity) has the superior binding activity of these two PSMA peptides. 
Example 8 

Vaccinating with epitope vaccines. 
20 1. Vaccination with peptide vaccines : 

A. Intranodal delivery 

A formulation containing peptide in aqueous buffer with an antimicrobial agent, an 
antioxidant, and an immunomodulating cytokine, was injected continuously over several days into 
the inguinal lymph node using a miniature pumping system developed for insulin delivery 
25 (MiniMed; Northridge, CA). This infusion cycle was selected in order to mimic the kinetics of 

antigen presentation during a natural infection. 

B. Controlled release 

A peptide formulation is delivered using controlled PLGA microspheres as is known in the 
art, which alter the pharmacokinetics of the peptide and improve immunogeniciry. This 
30 formulation is injected or taken orally. 
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C. Gene gun delivery 

A peptide formulation is prepared wherein the peptide is adhered to gold microparticles as 
is known in the art The particles are delivered in a gene gun, being accelerated at high speed so as 
to penetrate the skin, carrying the particles into dermal tissues that contain pAPCs. 
5 D. Aerosol delivery 

A peptide formulation is inhaled as an aerosol as is known in the art, for uptake into 
appropriate vascular or lymphatic tissue in the lungs, 
2. Vaccination with nucleic acid vaccines : 

A nucleic acid vaccine is injected into a lymph node using a miniature pumping system, 
10 such as the MiniMed insulin pump. A nucleic acid construct formulated in an aqueous buffered 

solution containing an antimicrobial agent, an antioxidant, and an immunomodulating cytokine, is 
delivered over a several day infusion cycle in order to mimic the kinetics of antigen presentation 

during a natural infection. 

Optionally, the nucleic acid construct is delivered using controlled release substances, such 
15 as PLGA microspheres or other biodegradable substances. These substances are injected or taken 

orally. Nucleic acid vaccines are given using oral delivery, priming the immune response through 
uptake into GALT tissues. Alternatively, the nucleic acid vaccines are delivered using a gene gun, 
wherein the nucleic acid vaccine is adhered to minute gold particles. Nucleic acid constructs can 
also be inhaled as an aerosol, for uptake into appropriate vascular or lymphatic tissue in tint lungs. 

20 Example 9 

Assays for the effectiveness of epitope vaccines. 

1. Tetramer analysis : 

Class I tetramer analysis is used to determine T cell frequency in an animal before and 
after administration of a housekeeping epitope. Clonal expansion of T cells in response to an 
25 epitope indicates that the epitope is presented to T cells by pAPCs. The specific T cell frequency 
is measured against the housekeeping epitope before and after administration of the epitope to an 
animal, to determine if the epitope is present on pAPCs. An increase in frequency of T cells 
specific to the epitope after administration indicates that the epitope was presented on pAPC. 

2. Proliferation assay : 

30 Approximately 24 hours after vaccination of an animal with housekeeping epitope, pAPCs 

are harvested from PBMCs, splenocytes, or lymph node cells, using monoclonal antibodies against 
specific markers present on pAPCs, fixed to magnetic beads for affinity purification. Crude blood 
or splenoctye preparation is enriched for pAPCs using this technique. The enriched pAPCs are 
then used in a proliferation assay against a T cell clone that has been generated and is specific for 

35 the housekeeping epitope of interest. The pAPCs are coincubated with the T cell clone and the T 

cells are monitored for proliferation activity by measuring the incorporation of radiolabeled 
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thymidine by T cells. Proliferation indicates that T cells specific for the housekeeping epitope are 
being stimulated by that epitope on the pAPCs. 
3. Chromium release assay : 

A human patient, or non-human animal genetically engineered to express human class I 
5 MHC, is immunized using a housekeeping epitope. T cells from the immunized subject are used in 

a standard chromium release assay using human tumor targets or targets engineered to express the 
same class I MHC. T cell killing of the targets indicates that stimulation of T cells in a patient 
would be effective at killing a tumor expressing a similar TuAA. 
Example 10 

1 0 Induction of CTL response with naked DNA is efficient by Intra-lymph node immunization. 

In order to quantitatively compare the CD8 + CTL responses induced by different routes of 
immunization a plasmid DNA vaccine (pEGFPL33A) containing a well-characterized 
immunodommant CTL epitope from the LCMV-glycoprotein (G) (gp33; amino acids 33-41) 
(Oehen, S., et al.. Immunology 99, 163-169 2000) was used, as this system allows a comprehensive 

15 assessment of antiviral CTL responses. Groups of 2 C57BL/6 mice were immunized once with 

titrated doses (200-0.02^ig) of pEGFPL33A DNA or of control plasmid pEGFP-N3, administered 
i.m. (intramuscular), i.d. (intradermal), i.spl (mtrasplenic), or i.ln. (intra-lymph node). Positive 
control mice received 500 pfu LCMV i.v. (intravenous). Ten days after immunization spleen cells 
were isolated and gp33-specific CTL activity was determined after secondary in vitro 

20 restimulation. As shown in Fig. 15, i.m. or i.d. immunization induced weakly detectable CTL 
responses when high doses of pEFGPL33A DNA (200ug) were administered. In contrast, potent 
gp33-specific CTL responses were elicited by immunization with only 2ug pEFGPL33A DNA 
i.spi. and with as little as 02\ig pEFGPL33A DNA given i.ln. (figure 15; symbols represent 
individual mice and one of three similar experiments is shown). Immunization with the control 

25 pEGFP-N3 DNA did not elicit any detectable gp33-specific CTL responses (data not shown). 

Example 11 

Intra-lymph node DNA immunization elicits anti-tumor immunity. 

To examine whether the potent CTL responses elicited following i.ln. immunization were 
able to confer protection against peripheral tumors, groups of 6 C57BL/6mice were immunized 

30 three times at 6-day intervals with 10ug of pEFGPL33A DNA or control pEGFP-N3 DNA. Five 
days after the last immunization small pieces of solid tumors expressing die gp33 epitope (EL4-33) 
were transplanted s.c. into both flanks and tumor growth was measured every 3-4d Although the 
EL4-33 tumors grew well in mice that had been repetitively immunized with control pEGFP-N3 
DNA (figure 16), mice which were immunized with pEFGPL33A DNA i.ln. rapidly eradicated the 

35 peripheral EL4-33 tumors (figure 16). 
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Example 12 

Differences in lymph node DNA content mirrors differences in CTL response following intra- 
lymph node and intramuscular injection. 

pEFGPL33 A DNA was injected i.ln. or i.m. and plasmid content of the injected or draining 
5 lymph node was assessed by real time PCR after 6, 12, 24, 48 hours, and 4 and 30 days. At 6, 12, 

and 24 hours the plasmid DNA content of the injected lymph nodes was approximately three orders 
of magnitude greater than that of the draining lymph nodes following i.m. injection. No plasmid 
DNA was detectable in the draining lymph node at subsequent time points (Fig. 17). This is 
consonant with the three orders of magnitude greater dose needed using i.m. as compared to i.ln. 
10 injections to achieve a similar levels of CTL activity. CD8 V " knockout mice, which do not develop 

a CTL response to this epitope, were also injected i.ln. showing clearance of DNA from the lymph 
node is not due to CD8 + CTL killing of cells in the lymph node. This observation also supports the 
conclusion that Lin. administration will not provoke immunopathological damage to the lymph 
node. 

15 Example 13 

Administration of a DNA plasmid formulation of a therapeutic vaccine for melanoma to 
humans. 

SYNCHROTOPE TA2M, a melanoma vaccine, encoding the HLA-A2-restricted tyrosinase 
epitope SEQ ID NO. 1 and epitope cluster SEQ ID NO. 69, was formulated in 1% Benzyl alcohol, 

20 1% ethyl alcohol, 0.5mM EDTA, citrate-phosphate, pH 7,6. Aliquots of 80, 160, and 320 yg 

DNA/ml were prepared for loading into MINIMED 407C infusion pumps. The catheter of a 
SILHOUETTE infusion set was placed into an inguinal lymph node visualized by ultrasound 
imaging. The assembly of pump and infusion set was originally designed for the delivery of insulin 
to diabetics and the usual 17mm catheter was substituted with a 3 1mm catheter for this application. 

25 The infusion set was kept patent for 4 days (approximately 96 hours) with an infusion rate of about 

25 jil/hour resulting in a total infused volume of approximately 2.4 ml. Thus the total administered 
dose per infusion was approximately 200, and 400 \xg\ and can be 800 yg, respectively, for the 
three concentrations described above. Following an infusion subjects were given a 10 day rest 
period before starting a subsequent infusion. Given the continued residency of plasmid DNA in the 

30 lymph node after administration (as in example 12) and the usual kinetics of CTL response 
following disappearance of antigen, this schedule will be sufficient to maintain the immunologic 
CTL response. 
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Example 14 
Additional Epitopes. 

The methodologies described above, and in particular in examples 3-7, have been applied 
to additional synthetic peptide substrates, leading to the identification of further epitopes as set for 

5 the in tables 15-36 below. The substrates used here were designed to identify products of 

housekeeping proteasomal processing that give rise to HLA-A*0201 binding epitopes, but 
additional MHC-binding reactivities can be predicted, as discussed above. Many such reactivities 
are disclosed, however, these listings are meant to be exemplary, not exhaustive or limiting. As 
also discussed above, individual components of the analyses can be used in varying combinations 

10 and orders. The digests of the NY-ESCM substrates 136-163 and 150-177 (SEQ ID NOS. 254 and 
255, respectively) yielded fragments that did not fly well in MALDI-TOF mass spectrometry. 
However, they were quite amenable to N-terminal peptide pool sequencing, thereby allowing 
identification of cleavage sites. Not all of the substrates necessarily meet the formal definition of 
an epitope cluster as referenced in example 3. Some clusters are so large, e.g. NY-ESO-Iw-iti, that 

15 it was more convenient to use substrates spanning only a portion of this cluster. In other cases, 
substrates were extended beyond clusters meeting the formal definition to include neighboring 
predicted epitopes. In some instances, actual binding activity may have dictated what substrate was 
made, as with for example the MAGE epitopes reported here, where HLA binding activity was 
determined for a selection of peptides with predicted affinity, before synthetic substrates were 

20 designed. 
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Example 15 

Evaluating Likelihood of Epitope Cross-reactivity on Non-target Tissues. 

As noted above PSA is a member of the kallikrein family of proteases, which is itself a 
subset of the serine protease family. While the members of this family sharing the greatest degree 
5 of sequence identity with PSA also share similar expression profiles, it remains possible that 

individual epitope sequences might be shared with proteins having distinctly different expression 
profiles. A first step in evaluating the likelihood of undesirable cross-reactivity is the identification 
of shared sequences. One way to accomplish this is to conduct a BLAST search of an epitope 
sequence against the SWISSPROT or Entrez non-redundant peptide sequence databases using the 

1 0 "Search for short nearly exact matches" option; hypertext transfer protocol accessible on the world 
wide web (http://www) at ^cbi.nlmnirLgovMast/index.html". Thus searching SEQ ID NO. 214, 
WVLTAAHCI, against SWISSPROT (limited to entries for homo sapiens) one finds four exact 
matches, including PSA. The other three are from kallikrein 1 (tissue kallikrein), and elastase 2A 
and 2B. While these nine amino acid segments are identical, the flanking sequences are quite 

15 distinct, particularly on the C-terminal side, suggesting that processing may proceed differently and 

that thus the same epitope may not be liberated from these other proteins. (Please note that 
kallikrein naming is confused. Thus the kallikrein 1 [accession number P06870] is a different 
protein than the one [accession number AAD13817] mentioned in the paragraph on PSA above in 
the section on tumor-associated antigens). 

20 It is possible to test this possibility in several ways. Synthetic peptides containing the 

epitope sequence embedded in the context of each of these proteins can be subjected to in vitro 
proteasomal digestion and analysis as described above. Alternatively, cells expressing these other 
proteins, whether by natural or recombinant expression, can be used as targets in a cytotoxicity (or 
similar) assay using CD8 + T cells that recognize the epitope, in order to determine if the epitope is 

25 processed and presented. 
Example 16 
Epitope Clusters. 

Known and predicted epitopes are generally not evenly distributed across the sequences of 
protein antigens. As referred to above, we have defined segments of sequence containing a higher 

30 than average density of (known or predicted) epitopes as epitope clusters. Among the uses of 
epitope clusters is the incorporation of their sequence into substrate peptides used in proteasomal 
digestion analysis as described herein. Epitope clusters can also be useful as vaccine components. 
A fuller discussion of the definition and uses of epitope clusters is found in U.S. Patent Application 
No. 09/561,571 entitled EPITOPE CLUSTERS, previously incorporated by reference. 

35 The following tables (37-60) present 9-mer epitopes predicted for HLA-A2 binding using 

both the SYFPErTHI and NIH algorithms and the epitope density of regions of overlapping 
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epitopes, and of epitopes in the whole protein, and the ratio of these two densities. (The ratio must 
exceed one for there to be a cluster by the above definition; requiring higher values of this ratio 
reflect preferred embodiments). Individual 9-mers are ranked by score and identified by the 
position of their first amino in the complete protein sequence. Each potential cluster from a protein 
is numbered. The range of amino acid positions within the complete sequence that the cluster 
covers is indicated as are the rankings of the individual predicted epitopes it is made up of. 



Table 37 

BIMAS-NIH/Parker algorithm Results for gplOO 



Rank 


Start 


Score 


Rank 


Start 


Score 


1 


619 


1493 


21 


416 


19 


2 


602 


413 


22 


25 


IS 


3 


162 


one 




DOO 


17 


4 


18 


118 


24 


603 


15 


5 


178 


118 


25 


384 


14 


6 


273 


117 


26 


13 


14 


7 


601 


81 


27 


290 


12 


8 


243 


63 


28 


637 


10 


9 


606 


60 


29 


639 


9 


10 


373 


50 


30 


485 


9 


11 


544 


36 


31 


453 


8 


12 


291 


29 


32 


102 


8 


13 


592 


29 


33 


399 


8 


14 


268 


29 


34 


456 


7 


15 


47 


27 


35 


113 


7 


16 


585 


26 


36 


622 


7 


17 


576 


21 


37 


69 


7 


18 


465 


21 


38 


604 


6 


19 


570 


20 


39 


350 


6 


20 


9 


19 


40 


583 


5 



WO 03/008537 PCT/US02/10189 

121 



Table 38 

SYFPEITHI (Rammensee algorithm) Results for gplOO 



Rank 


Start 


Score 


Rank 


Start 


Score | 


Rank 


Start 


Score 


1 


606 


30 


37 


291 


20 


73 


60 


18 


2 


162 


29 


38 


269 


20 


74 


17 


18 


3 


456 


28 


39 


2 


20 


75 


613 


17 


4 


18 


28 


40 


610 


19 


76 


599 


17 


5 


602 


27 


41 


594 


19 


77 


572 


17 


6 


598 


27 


42 


591 


19 


78 


557 


17 


7 


601 


26 


43 


583 


19 


79 


556 


17 


8 


597 


26 


44 


570 


19 


80 


512 


17 


9 


13 


26 


45 


488 


19 


81 


406 


17 


10 


585 


25 


46 


446 


19 


82 


324 


17 


11 


449 


25 


47 


322 


19 


83 


290 


17 


12 


4 


25 


48 


267 


19 


84 


101 


17 


13 


603 


24 


49 


250 


19 


85 


95 


17 


14 


576 


24 


50 


205 


19 


86 


635 


16 


15 


453 


24 


51 


180 


19 


87 


588 


16 


16 


178 


24 


52 


169 


19 


88 


584 


16 


17 


171 


24 


53 


88 


19 


89 


577 


16 


18 


11 


24 


54 


47 


19 


90 


559 


16 


19 


619 


23 


55 


10 


19 


91 


539 


16 


20 


280 


23 


56 


648 


18 


92 


494 


16 


21 


268 


23 


57 


605 


18 


93 


482 


16 


22 


592 


22 


58 


604 


18 


94 


468 


16 


23 


544 


22 


59 


595 


18 


95 


442 


16 


24 


465 


22 


60 


571 


18 


96 


413 


16 


25 


399 


22 


61 


569 


18 


97 


408 


16 


26 


373 


22 


62 


450 


18 


98 


402 


16 


27 


273 


22 


63 


409 


18 


99 


286 


16 


28 


243 


22 


64 


400 


18 


100 


234 


16 


29 


566 


21 


65 


371 


18 


101 


217 


16 


30 


563 


21 


66 


343 


18 


102 


211 


16 


31 


485 


21 


67 


298 


18 


103 


176 


16 


32 


384 


21 


68 


209 


18 


104 


107 


16 


33 


350 


21 


69 


102 


18 


105 


96 


16 


34 


9 


21 


70 


97 


18 


106 


80 


16 


35 


463 


20 


71 


76 


18 


107 


16 


16 


36 


397 


20 


72 


69 


18 


108 


14 


16 














109 


7 


16 
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Table 39 

Prediction of clusters for gp100 

Total Ms: 661 
Total 9-mers: 653 
SYFPEITHI 16: 109 9-mers 
NIH 5: 40 9-mers 











Epitopes/AA 




Cluster # 


Ms 


Epitopes (by Rank) 


Cluster 


Whole Pr 


Ratio I 


SYFPEITHI 


1 


2 to 26 


39,12,109,34,55,11.9, 
108. 107, 74, 4 


0.440 


0.165 


2.668 




2 


69-115 


72. 71, 106, 53, 85, 105, 
70. 84, 69, 104 


0.213 


0165 


1.290 




3 


95-115 


85. 105. 70. 84. 69 


0.238 


0l65 


1.444 




A 

4 


162-188 


2.52.17. 103.16. 51 


0.222 


0.165 


1.348 I 




5 


205-225 


50, 68, 102. 101 


0.190 


0.165 


1 155 I 




6 


243-258 


28^49 


0.125 


0.165 


0.758 




7 


267-306 


48.21.38.27,20.99.83.37.67 


0.225 


0.165 


1.364 I 




A 

8 


322-332 


47J52 


0.182 


0.165 


1.103 1 




9 


343-358 


6JU3 


0.125 


0.165 


0.758 




10 


371-381 


65^26 


0.182 


0.165 


1.103 




11 


397-421 


36. 25, 64, 98, 81, 97, 63, 96 


0.320 


0.165 


1.941 




12 


442-476 


95. 46. 11. 62. 15. 3. 35. 24. 94 


0.257 


0.165 


1.559 




13 


482-502 


93. 31.45, 93 


a a nn 

0.190 


0.165 


A A EC i 
1.100 J 




14 


539-552 


91^3 


0.143 


0.165 


0.866 




15 


556-627 


79.78,90,30,29,61,44,60,77,14, 
89, 43, 88, 10, 87, 42, 22, 41, 59, 8, 
6,76,7,5,13,58,57,1,40,75,19 


0.431 


0.165 


2.611 


NIH 


1 


9 to 33 


20. 26, 4. 22 


0.160 


0.061 


2.644 




2 


268-281 


14J 


0.143 


0.061 


2.361 




3 


290-299 


27.12 


0.200 


0.061 


3.305 




4* 


102-121 


32,35 


0.100 


0.061 


1.653 




5* 


373-392 


10.25 


0.100 


0.061 


1.653 




6 


453473 


31.34, 18 


0.143 


0.061 


2.361 




7 


566-600 


23.19, 17.40.16,13 


0.171 


0.061 


2.833 




8 


601-614 


7. 2. 24. 38, 9 


0.357 


0.061 


5.902 




9 


619-630 


1,36 


0.17 


0.061 


2.754 




10 


637-647 


28,29 


0.18 


0.061 


3.005 



'Nearby but not overlapping epitopes 
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Table 40 

BIMAS-NIH/Parker algorithm Results for PSMA 



Rank 


Start 


Score 


1 


663 


1360 


2 


711 


1055 


3 


4 


485 


4 


27 


400 


5 


26 


375 


6 


668 


261 


7 


707 


251 


8 


469 


193 


9 


731 


177 


10 


35 


67 


11 


33 


64 


12 


554 


59 


13 


427 


50 


14 


115 


47 


IS 


20 


40 


16 


217 


26 


17 


583 


24 


18 


415 


19 


19 


193 


14 


20 


240 


12 


21 


627 


11 


22 


260 


10 


23 


130 


10 


24 


741 


9 


25 


3 


9 


26 


733 


8 


27 


726 


7 


28 


286 


6 


29 


174 


5 


30 


700 


5 



WO 03/008537 



124 



PCT7US02/10189 



Table 41 

SYFPEITHI (Rammensee algorithm) Results for PSMA 



Rank 


Start 


Score 


Rank 


Start 


Score 


Rank 


Start 


Score 


i 


469 


27 


31 


26 


20 


61 


305 


17 


2 


27 


27 


32 


3 


20 


62 


304 


17 


3 


741 


26 


33 


583 


19 


63 


286 


17 


4 


711 


26 


34 


579 


19 


64 


282 


17 


S 


354 


25 


35 


554 


19 


65 


169 


17 


6 


4 


25 


36 


550 


19 


66 


142 


17 


7 


663 


24 


37 


547 


19 


67 


122 


17 


8 


130 


24 


38 


390 


19 


68 


738 


16 


9 


57 


24 


39 


219 


19 


69 


634 


16 


10 


707 


23 


40 


193 


19 


70 


631 


16 


11 


260 


23 


41 


700 


18 


71 • 


515 


16 


12 


20 


23 


42 


472 


18 


72 


456 


16 


13 


603 


22 


43 


364 


18 


73 


440 


16 


14 


218 


22 


44 


317 


18 


74 


385 


16 


15 


109 


22 


45 


253 


18 


75 


373 


16 


16 


731 


21 


46 


91 


18 


76 


365 


16 


17 


668 


21 


47 


61 


18 


77 


361 


16 


18 


660 


21 


48 


13 


18 


78 


289 


16 


19 


507 


21 


49 


733 


17 


79 


278 


16 


20 


454 


21 


50 


673 


17 


80 


258 


16 


21 


427 


21 


51 


671 


17 


81 


247 


16 


22 


358 


21 


52 


642 


17 


82 


217 


16 


23 


284 


21 


53 


571 


17 


83 


107 


16 


24 


115 


21 


54 


492 


17 


84 


100 


16 


25 


33 


21 


55 


442 


17 


85 


75 


16 


26 


606 


20 


56 


441 


17 


86 


37 


16 


27 


568 


20 


57 


397 


17 


87 


30 


16 


28 


473 


20 


58 


391 


17 


88 


21 


16 


29 


461 


20 


59 


357 


17 








30 


200 


20 


60 


344 


17 
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Table 42 

Prediction of clusters for prostate-specific membrane antigen (PSMA) 

Total AAs: 750 
Total 9-mers: 742 
SYFPEITHI 16: 88 9-mers 
NIH 5: 30 9-mers 











Epltopes/AA 




Cluster # 


Aas 


Epitopes (by rank) 


Cluster 


Whole Pr 


Ratio 


SYFPEITHI 


1 


*K tn \0 

O IU IZ 


32 6 *~ 


0.200 


0.117 


1.705 


2 




13 12 88 31 2 87.25,86 


0.242 


0.117 


2.066 




3 


57-69 


9,47 


0.154 


0.117 


1.311 




4 




84 83.15.24.67,8 


0.154 


0.117 


1.311 




5 




40, 30 


0.111 


0.117 


0.947 




6 


217-227 


82, 14. 39 


0.273 


0.117 


2.324 




7 


247-268 


81,45, 80,11 


0.182 


0.117 


1.550 




8 


278-297 


79, 64, 23, 63, 78 


0.250 


0.117 


2.131 




9 


354-381 


5, 59, 22, 77, 43, 76, 75 


0.250 


0.117 


2.131 




10 


385-405 


74. 38, 58, 57 


0.190 


0.117 


1.623 




11 


440-450 


73, 56, 55 


0.273 


0.117 


2.324 




12 


454-481 


20,72,29,1,42,28 


0.214 


0.117 


1.826 




13 


507-523 


47 74 
17, ft 


n lift 

U. 1 10 


n 117 






14 


547-562 


37, 36, 35 


0.188 


0.117 


1.598 




15 


568-591 


27, 53, 34, 33 


0.167 


0.117 


1.420 




16 


603-614 


13,26 


0.167 


0.117 


1.420 




17 


631-650 


70, 69. 52 


0.150 


0.117 


1278 




18 


660-681 


18,7.17, 51,50 


0.227 


0.117 


1.937 




19 


700-719 


41,10.4 


0.150 


0.117 


1.278 




20 


731-749 


16. 49, 68. 3 


0.211 


0.117 


1.794 


fJiii 


1 


3to12 


25,3 


0.200 


0.040 


5.000 




2 


2043 


15,5,4,11,10 


0.208 


0.040 


5.208 




3* 


415435 


18, 13 


0.095 


0.040 


2.381 




4 


663-676 


1,6 


0.143 


0.040 


3.571 




5 


700-715 


30, 7. 3 


0.188 


0.040 


4.688 




6 


726-749 


27, 9, 26, 24 


0.167 


0.040 


4.167 



'Nearby but 
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Table 43 

BEMAS-NIH/Parker algorithm Results for PSA 



Rank 


Start 


Score 


1 


7 


607 


2 


170 


243 


3 


52 


124 


4 


53 


112 


5 


195 


101 


6 


165 


23 


7 


72 


18 


8 


245 


18 


9 


2 


16 


10 


59 


16 


11 


122 


15 


12 


125 


15 


13 


191 


13 


14 


9 


8 


15 


14 


6 


16 


175 


5 


17 


130 


5 
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Table 44 



Rank 


Start 


Score 


1 


72 


26 


2 


170 


22 


3 


53 


22 


4 


7 


22 


5 


234 


21 


6 


166 


21 


7 


140 


21 


8 


66 


21 


9 


241 


20 


10 


175 


20 


11 


12 


20 


12 


41 


19 


13 


20 


19 


14 


14 


19 


15 


130 


18 


16 


124 


18 


17 


121 


18 


18 


47 


18 


19 


17 


18 


20 


218 

mm m* %* 


17 


21 


133 


17 


22 


125 


17 


23 


122 


17 


24 


118 


17 


25 


110 


17 


26 


67 


17 


27 


52 


17 


28 


21 


17 


29 


16 


17 


30 


2 


17 


31 


184 


16 


32 


179 


16 


33 


158 


16 


34 


79 


16 


35 


73 


16 


36 


4 


16 
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Table 45 

Prediction of clusters for prostate specific antigen (PSA) 

Total AAs: 261 
Total 9-mers: 253 
SYFPEITHI 16: 36 9-mers 
NIH 5: 17 9-mers 











Epltopes/AA 




Cluster # 


AAs 


Epitopes (by rank) 


Cluster 


Whole Pr 


Ratio 


SYFPEITHI 


1 


2 to 29 


30,36,4,11,14.29,19, 13,28 


0.321 


0.138 


2.330 


2 


41-61 


12, 18, 27, 3 


0.190 


0.138 


1.381 




3 


66-87 


8.26,1.35.34 


0.227 


0.138 


1.648 




4 


110-148 


25.24,17. 23, 16, 22, 15.21,7 


0.184 


0.138 


1.332 




5 


158-192 


33, 6, 2, 10, 32. 31 


0.171 


0.138 


1.243 




6 


234-249 


5,9 


0.125 


0.138 


0.906 




r 


118-133 


24. 17.23.16.22 


0.313 


0.138 


2.266 




8* 


118-138 


24,17, 23,16,22,15 


0.286 


0.138 


2.071 


NIH 


1 


2-22 


9, 1, 14, 15 


0.190 


0.065 


2.924 




2 


52-67 


3,4,10 


0.188 


0.065 


2.879 




3 


122-138 


11,12,17 


0.176 


0.065 


2.709 




4 


165-183 


6, 2, 16 


0.158 


0.065 


2.424 




5 


191-203 


13.5 


0.154 


0.065 


2.362 




6** 


52-80 


3, 4. 10, 7 


0.138 


0.065 


2.118 



These clusters are internal to the less preferred cluster #4. 



Includes a nearby but not overlapping epitope. 
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Table 46 

BIMAS-NIH/Parker algorithm Results for PSCA 





Start 


Score 


1 






2 






3 


/ 


70 


4 
** 


109 


36 


5 


105 


ltf 


6 


108 


24 


7 


14 


21 


8 


20 


18 


9 


115 


15 


10 


42 


11 


36 


15 


12 


99 


9 


13 


58 


8 



7B 



Table 47 

SYFPETTHI (Rammensee algorithm) Results for PSCA 



Rank 


Start 


Score 


Rank 


Start 


Score 


1 


108 


30 


17 


54 


19 


2 


14 


30 


18 


12 


19 


3 


105 


29 


19 


4 


19 


4 


5 


28 


20 


1 


19 


5 


115 


26 


21 


112 


18 


6 


99 


26 


22 


101 


18 


7 


7 


26 


23 


98 


18 


8 


109 


24 


24 


51 


18 


9 


53 


23 


25 


43 


18 


10 


107 


21 


26 


106 


17 


11 


20 


21 


27 


104 


17 


12 


8 


21 


28 


83 


17 


13 


13 


20 


29 


63 


17 


14 


102 


19 


30 


50 


17 


15 


60 


19 


31 


3 


17 


16 


57 


19 


32 


9 


16 








33 


92 


16 
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Table 48 

Prediction of clusters for prostate stem cell antigen (PSCA) 

Total AAs: 123 
Total 9-mers: 1 1 5 
SYFPEITHI 16: 33; 
SYFPEITHI 20:13 
NIH 5:13 













Epitopes/AA 






Cluster # 


AAs 


Epitopes (by rank) 


Cluster 


Whole Pr. 


Ratio 


SYFPEITHI >1 6 


1 


1to28 


20,31,19,4,7,12,33,18,13,2,11 


0.393 


0.268 


1.464 




2 


43-71 


25, 30, 24, 9, 17, 16, 15, 29 


0.276 


0.268 


1.028 




3 


92-123 


32,23,6,27,14.22,3,26,10. 


0.406 


0.268 


1.514 








1.8,21.5 








SYFPEITHI >20 


1 


5 to 28 


4.7,12,13,2,11 


0.250 


0.106 


2.365 




2 


99-123 


6, 3, 10,1,8,5 


0.240 


0.106 


2.271 


NIH 


1 


5 to 28 


2, 3, 7, 8 


0.167 


0.106 


1.577 




2 


36-51 


11,10,1 


0.188 


0.106 


1.774 




3 


99-123 


12, 5, 6, 4, 9 


0.200 


0.106 


1.892 




4* 


105-116 


5,6,4 


0.250 


0.106 


2.365 



*This cluster is internal to the less preferred cluster #3. 
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Li tables 49-60 epitope prediction and cluster analysis data for each algorithm are 
presented together in a single table. 
Table 49 

Prediction of clusters for MAGE-1 (NIH algorithm) 

Total AAs: 309 
Total 9-mers: 301 
NIH 5:19 9-mers 



Cluster # 


AAs 


Epitope 


Start 


NIH 




Epitopes/AA 


Ratio 






Rank 


Position 


Score 


Cluster 


Whole Pr. 


1 


18-32 


16 


18 


9 


0.133 


0.063 


2.112 






19 


24 


7 








2 


101-113 


14 


101 


11 


0.154 


0.063 


2.442 






7 


105 


44 








3 


146-159 


9 


146 


32 


0.143 


0.063 


2.263 






3 


151 


169 








4 


169-202 


10 


169 


32 


0.176 


0.063 


2.796 






13 


174 


16 












18 


181 


8 












17 


187 


8 












6 


188 


74 












5 


194 


110 








5 


264-277 


2 


264 


190 


0.143 


0.063 


2.263 






12 


269 


20 








6 


278-290 


1 


278 


743 


0.154 


0.063 


2.437 






11 


282 


28 









* 
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Table 50 

Prediction of clusters for MAGE-1 (SYFPEITHI algorithm) 



Total AAs: 309 
Total 9-mers: 301 
SYFPEITHI 16: 46 9-mers 



Cluster # 

V/IUOLvl IT 


Aas 


Eoitooe 


Start 


SYFPEITHI 


Epitopes/AA 




Rank 


Position 


Score 


Cluster Whole Ratio 


1 
1 


7-49 


22 


7 


19 


0.233 0.153 1.522 






9 


15 


22 








27 


18 


18 








16 


20 


20 








28 


22 


18 








29 


24 


18 








33 


31 


17 








30 


35 


18 








2 


38 


26 








17 


41 


20 




2 


89-132 


10 


89 


22 


0.273 0.153 1.783 






18 


92 


20 








7 


93 


23 








23 


96 


19 








43 


98 


16 








4 


101 


25 








8 


105 


23 








34 


107 


17 








35 


108 


17 








36 


113 


17 








37 


118 


17 








19 


124 


20 




3 


167-203 


44 


167 


16 


0.270 0.153 1.766 






20 


169 


20 








12 


174 

• • * 


21 








24 


181 


19 








6 


187 

1 w ■ 


24 








31 


188 


18 








25 


191 


19 








38 


192 


17 








1 


194 


27 








13 


195 


21 




4 


230-246 


14 


230 


21 


0.118 0.153 0.769 






39 


238 


17 




5 


264-297 


15 


264 


21 


0.235 0.153 1.538 






32 


269 


18 








40 


270 


17 








26 


271 


19 








46 


275 


16 








3 


278 


26 








21 


282 


20 








41 


289 


17 
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Table 51 

Prediction of clusters for MAGE-2 (NIH algorithm) 



Total Ms: 314 
Total 9-mers: 308 
NIH >= 5: 20 9-mers 



Cluster # 


AAs 


Epitope 


Start 


Nln 












Rank 


Position 


score 


uiuster 


Whnln Pr 


Ratio 


1 


101-120 


18 


101 


5.373 


0.150 


0.065 








lO 


lUO 


6 756 












1 


112 


2800.697 








2 


153-167 


8 


153 


31.883 


0.200 


0.065 


3.080 






4 


158 


168.552 












7 


159 


32.138 








3 


169-211 


14 


169 


8.535 


0.209 


0.065 


3.223 






19 


174 


5.346 












6 


176 


49.993 












11 


181 


15.701 












15 


188 


7.536 












12 


195 


12.809 












5 


200 


88.783 












10 


201 


16.725 












17 


203 


5.609 








4 


271-284 


3 


271 


398.324 


0.143 


0.065 


2.200 






9 


276 


19.658 
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Table 52 

Prediction of clusters for MAGE-2 (SYFPEITH1 algorithm) 



Total AAs: 314 
Total 9-mers: 308 
SYFPEITHI 1 6: 52 9-mers 



Cluster * 


AAs 


Emtooe 


Start 


SYFPEITHI 




EDitones/AA 






Rank 


Position 


Score 


Cluster 


Whole rr. 


Ratio 


1 


15-32 


13 


15 


21 


0.278 


0.169 


1.645 






29 


A ft 

18 


A O 

18 












A ft 

43 


oo 

20 


A ft 

16 












30 


22 


AO 

18 












71 


24 


A f\ 

19 








2 


37-56 


O f4 

31 


37 


A ft 

18 


A or* A 

0.250 


A A PA 

0.169 


1.481 






4 O 

16 


40 


AA 

20 












44 


44 


A ft 

16 












14 


45 


ftA 

21 












99 


4ft 


A ft 

19 _ 








3 


96-133 


36 


96 


17 


0.211 


0.169 


A ft) AT 

1.247 






A /•» 

46 


A ft A 

101 


A ft 

16 












6 


108 


ft& 
25 












47 


4 nn 

109 


A O 

16 












o 

2 


4 A ft 

112 


rvi 

27 












37 


A ft ft 

120 


17 












38 


A or* 

125 


17 












A f 

17 


A ft A 

.131 


OA 

20 








4 


153-216 


12 


153 


22 


f\ n a a 

0.344 


O 4 oo 

0.169 


o oo o 

2.036 | 






o a 

39 


158 


17 












mm 

7 


159 


25 












23 


At A J 

161 


19 












24 


162 


19 












48 


164 


16 












49 


a\ Mt\mm 

167 


A A 

16 












32 


170 


18 












50 


At J 

171 


16 












4 


174 


26 












9 


176 


o J 

24 












r" A 

51 


177 


16 












15 


A n A 

181 


21 












#% am> 

25 


188 


19 












A ft 

18 


A ft A 

194 


AA 

20 












33 


195 


18 












19 


198 


20 












3 


200 


27 












1 


201 


28 












40 


202 


17 












10 


203 


23 












52 


208 


16 








5 


237-254 


26 


237 


19 


0.167 


0.169 


0.987 






27 


245 


19 












34 




18 








6 


271-299 


8 


271 


25 


0.241 


0.169 


1.430 




* 


35 


276 


18 












41 


277 


17 


* 










11 


278 


23 












28 


283 


19 












20 


285 


20 












42 


291 


. 17 
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Table 53 

Prediction of clusters for MAGE-3 (NIH algorithm) 



Total Ms: 314 
Total 9-mers: 308 
NIH 5: 22 9-mers 



Cluster # 


AAs 


Epitope 


Start 


NIH 




Epitopes/ AA 








Rank 


Position 


Score 


Cluster 


Whole Pr. 


Ratio 


1 


101-120 


15 


101 


11.002 


0.200 


0.071 


2.800 






21 


105 


6.488 












8 


108 


49.134 














•110 










2 


153-167 


18 


153 


7.776 


0.200 


0.071 


2.800 






6 


158 


51.77 












22 


159 


5.599 








3 


174-209 


17 


174 


8.832 


0.194 


0.071 


2.722 






7 


176 


49.993 












13 


181 


15.701 












19 


188 


7.536 












14 


195 


12.809 












5 


200 


88.783 












12 


201 


16.725 








4 


237-251 


16 


237 


10.868 


0.200 


0.071 


2.800 






4 


238 


148.896 












20 


243 


6.88 








5 


271-284 


1 


271 


2655.495 


0.143 


0.071 


2.000 






11 


276 


19.658 
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Table 54 

Prediction of clusters for IWAGE-3 (SYFPEITHI algorithm) 



Total Ms: 314 
Total 9-mers: 308 
SYFPEITHI 16: 47 9-mers 



rinetor ft 

LflUSlci If 


AAo 

MAS 


Luiiuuc 


Start 


SYFPEITHI 




EnifortP^/AA 

uuuuuwinM 


I 




Rank 


Position 


Score 


Cluster 


Whole Pr. 


Ratio I 


4 
1 


i* \o 


19 


1 w 


91 


0 278 


0 153 


1 890 






26 


18 


18 












37 


20 


16 












27 


22 


18 












18 


24 


19 








0 
t- 


00-00 




18 


16 


0 263 


0 153 


1725 






15 


40 


20 












39 


44 


16 












13 


45 


21 












19 


48 

-32 


19 








\ 
o 




98 


101 


18 


0 190 


0 153 


1 248 






40 


105 


16 












1 


108 


31 












A 


112 

lib 


25 












31 


120 


17 


• 


t 








32 


125 


17 














131 


20 












41 

-LI 


134 


16 








A 

H 






151 


1Q 




0 153 


2 048 






29 


156 


18 












?? 

00 


158 


17 












21 


159 


19 












OH 


181 


17 












42 


164 


16 














ID/ 


18 
10 












10 


174 


22 












ft 

u 


176 


23 












14 


181 


21 














188 


19 












44 


193 


16 












11 


194 


22 












23 


195 


19 












45 


197 


16 












17 


198 


20 












3 


200 


27 












2 


201 


28 












35 


202 


17 












46 


208 


16 








5 


220-230 


5 


220 


26 


0.182 


0.153 


1.191 






47 


222 


16 








6 


237-246 


7 


237 


25 


0.200 


0.153 


1.311 






9 


238 


23 








7 


271-293 


4 


271 


27 


0.217 


0.153 


1.425 






30 


276 


18 












24 


278 


19 












36 


283 


17 












25 


285 


19 
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Table 55 

Prediction of clusters for PRAME (NIH algorithm) 



Total Ms: 509 
Total 9-mers: 501 
NIH 5: 40 9-mers 



Cluster # 


AAs 


Epitope 
Rank 


Start 
Position 


MIU 

Nln 

Score 


Cluster 


Cn'if nriCkclAA 
CpilOpca/Mr\ 

Whole Pr. 


Ratio 


1 


33-47 


20 
17 


33 
39 


18 
21 


0.133 


0.080 


1.670 


2 


71-81 


9 

32 


71 
73 


50 
7 


0.2 


0.07984 


Z.OVO 


3 


99-108 


23 
24 


100 
99 


15 
13 


0.2 


0.07984 


2.505 


4 


126-135 


38 
35 


126 
127 


5 
6 


0.2 


0.07984 


2.505 


5 


224-246 


5 
8 
39 


224 
230 
238 


124 
63 
5 


0.130 


0.080 


1.634 


6 


290-303 


18 
14 
7 


290 
292 
295 


18 

23 
66 


0.214 


0.080 


2.684 


7 


305-324 


28 

oU 

25 
36 


305 
ouo 
312 
316 


10 

Q 
O 

13 
6 


0.200 


0.080 


2.505 


8 


394-409 


2 
12 
31 


394 
397 
401 


182 
42 
7 


0.188 


0.080 


2.348 


9 


422-443 


10 
3 
34 
29 
4 


422 
425 
431 
432 
435 


49 
182 

7 

9 

160 


0.227 


0.080 


2.847 


10 


459-487 


15 
11 
22 
40 
37 


459 
462 
466 
472 
479 


21 

45 

15 

5 

6 


0.172 


0.080 


2.159 
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Table 56 

Prediction of clusters for PRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITHI 17: 80 9-mers 





A Ac 
MAS 


epitope 


Olall 


QYFPPITHI 




F nitnne<%/A A 








KanK 


Dnc 14* 1 rvn 

r*05inon 


OL.UIB 




Whole Pr 


Ratio 


1 


4 A r* a 

18-59 


65 


18 


47 
If 


n ooo 
O.ZOO 


A 4CA 

U.10U 


1 AQ1 

i.4y i 






CA 

50 


21 


lo 












ec 

66 


oc 
26 


17 
If 












OC 

35 


00 


OA 

zu 












22 


Oil 

34 


oo 
Zz 












51 


37 


40 












5 


on 

oy 


07 
Z/ 












23 


4 A 

40 


OO 

zz 












lo 


AA 
44 


Z4 












40 


C1 
01 


10 








Z 




00 


10 


on 

zu 


U.^DO 


fl 1fi0 


1 64£ 






fi7 




17 












c.0 
0Z 


P.A 
04 


1ft 
10 












OA 
Z4 


ftp. 
00 


00 












00 


01 


10 












OK 

ZD 


yo 


00 

zz 












A 

9 


OA 

yy 


OR 












Q 

0 


4 aa 
TOO 


Oft 
ZD 












04 


4 no 


•1ft 

10 












00 


4 A.7 


4ft 
10 








A 

3 


4f\ 4 aaa 

191-202 


56 


4 OH 

191 


40 
10 


f\ 1R7 


U. ICU 


1 OAA ' 
1.1/44 






00 


ly4 


on 
zu 








4 


205-21 o 


ZD 


OAC 

ZU0 


oo 
zz 


ft 1P.0 
U.lOZ 


U. I0U 


1. 1 oy 






27 


0A7 

zu/ 


00 

zz 








5 


AAA AAA 

222-238 


47 


AAA 

222 


4 A 

19 


0.235 


A 4CA 


1 A7A 

1.4/4 






14 


A A A 

224 


Ail 

24 












A A 

69 


A AT 

227 


4 "7 

17 












57 


AAA 

230 


4 A 

18 








6 


241-273 


70 


241 


17 


0.212 


A 4 £f\ 

0.160 


4 OOO 

1.328 






is 

10 


OAfl 














71 


255 


17 












30 


258 


21 












39 


259 


20 












58 


261 


18 












40 


265 


20 








7 


290-342 


72 


290 


17 


0.208 


0.160 


1.300 






48 


293 


19 












31 


298 


21 












73 


301 


17 












18 


305 


23 












6 


308 


27 












10 


312 


25 












19 


316 


23 












28 


319 


22 
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Prediction of clusters for PRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITH I 1 7: 80 9-mers 



Cluster w 


A A** 

AAS 


epitope 


Oldl 1 


SYFPEITHI 

o i rrui 1 1 ii 


EDitooes/AA 


Ratio 




KanK 


rosmon 




Cluster Whole Pr 






g A 

41 


326 


OA 










74 


334 


17 






8 


343-363 


59 


343 


A A 

18 


0.238 U.lbU 


1. 43 I 






60 


348 


A 6 

18 










75 


r\ i- A 

351 


17 










20 


4*k f /V 

353 


23 










76 


355 


17 






9 


364-447 


49 


364 


19 


l\ OCA A A CA 

0.250 0.1 DO 


1.000 






32 


371 


fsA 

21 










11 


A A 

372 


25 










61 


375 


A ft 

18 










77 


A A Af% 

382 


A "9 

17 










21 


4fc JL rt 

390 


AA 

23 










78 


391 


17 










1 


394 


30 










42 


397 


aA 

20 










62 


403 


A D 










33 


410 


21 










A A 

43 


418 


20 












419 


21 








• 


7 


422 


27 










2 


425 


29 










79 


426 


17 










63 


428 


18 










64 


431 


18 










12 


432 


25 










16 


435 


24 










80 


439 


17 






10 


455-474 


29 


455 


22 


0.200 0.160 


1.253 






17 


459 


24 










4 


462 


28 










3 


466 


29 
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Table 57 

Predication of clusters for CEA (NIH algorithm) 



Total AAs:702 
Total 9-mers: 694 
NIH 5: 30 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


Cluster 


Peptides/ aas 
Whole Pr. 


Ratio 


1 


17-32 


5 
7 

20 


17 
18 
24 


79.041 
46.873 
12.668 


0.188 


0.043 


4.388 


2 


113-129 


2 
15 


no 

121 


lor .yy i 
21.362 


0 118 


0 043 


2.753 


3 


172-187 


25 
14 


172 
179 


9.165 
27.995 


0.125 


0.043 


2.925 


4 


278-291 


30 
17 


278 
283 


5.818 
19.301 


0.143 


0.043 


3.343 


5 


350-365 


9 
12 


350 
357 


43.075 
27.995 


0125 


0i043 


2.925 


6 


528-543 


8 
13 


528 
535 


43.075 
27.995 


0.125 


0.043 


2.925 


7 


631-645 


23 
19 
24 


631 
634 
637 


9.563 
13.381 
9.245 


0.200 


0.043 


4.680 


8 


691-702 

i 


1 

27 


691 
694 


196.407 
7.769 


0.167 


0.043 


3.900 
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Table 58 

Predication of clusters for CEA (SYFPEITHI algorithm) 

Total AAs:702 
Total 9-mers: 694 
SYFPEITHI 16:81 9-mers 



Cluster # 


AA 


Peptides 


Start 


Score 




Peptides/AAs 








Rank 


Position 




Cluster 


Whole Pr. 


Ratio 


1 

i 


5-36 


67 


5 


16 


0.250 


0.117 


2.140 






23 

^^^^ 


12 


19 












24 


16 


19 












9 


17 


22 












25 


18 


19 












32 


19 


18 












68 


23 


16 












33 


28 


18 








2 


37-62 


41 


37 


17 


0.269 


0.117 


2.305 






20 


44 


20 












26 


45 . 


19 












42 


46 


17 












27 


50 


19 












43 


53 


17 












44 


54 


17 








3 


99-115 


14 


99 


21 


0.235 


0.117 


2.014 






5 


100 


23 












45 


104 


17 












34 


107 


18 








4 


116-129 


69 


116 


16 


0.143 


0.117 


1.223 






21 


121 


20 








5 


172-187 


46 


172 


17 


0.125 


0.117 


1.070 






70 


179 


16 








6 


192-202 


3 


192 


24 


0.182 


0.117 


1.557 






47 


194 


17 




* 




7 


226-241 


48 


226 


17 


0.188 


0.117 


1.605 






49 


229 


17 












15 


233 


21 








8 


307-318 


11 


307 


22 


0.250 


0.117 


2.140 






71 


308 


16 












51 


310 


17 








9 


319-349 


52 


319 


17 


0.129 


0.117 


1.105 






53 


327 


17 












72 


335 


16 












35 


341 


18 








10 


370-388 


12 


370 


22 


0.211 


0.117 


1.802 






54 


372 


17 












74 


375 


16 












6 


380 


23 








11 


403-419 


56 


403 


17 


0.235 


0.117 


2.014 






57 


404 


17 












58 


407 


17 












28 


411 


19 
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12 


427-442 


59 


427 


17 


0.188 


0.117 


1.605 






75 


432 


16 












76 


434 


16 








13 


450-462 


77 


450 


16 


0.154 


0.117 


1.317 


* 




13 


454 


22 








14 


488-505 


36 


488 


18 


0.167 


0.117 


1.427 






18 


492 


21 












60 


497 


17 








15 


548-558 


4 


548 


24 


0.182 


0.117 


1.557 






61 


550 


17 








16 


565-577 


62 


565 


17 


0.154 


0.117 


1.317 






19 


569 


21 








17 


579-597 


78 


579 


16 


0.143 


0.117 


1.223 






79 


582 


16 












7 


589 


23 








18 


605-618 


2 


605 


25 


0.143 


0.117 


1.223 






38 


610 


18 








19 


631-669 


29 


631 


19 


0.154 


0.117 


1.317 


4 




63 


637 


17 












80 


644 


16 












64 


652 


17 












39 


660 


18 






r 






81 


661 


16 








20 


675-702 


22 


675 


20 


0.286 


0.117 


2.446 






30 


683 


19 












31 


687 


19 












40 


688 


18 












65 


690 


17 












1 


691 


31 












66 


692 


17 












8 


694 


23 









» 
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Table 59 

Predication of clusters for SCP-1 (NIH algorithm) 



Total Ms: 976 
Total 9-mers: 968 
NIH 5: 37 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


Cluster 


Peptides/AAs 
Whole Pr. 


Ratio 


1 


101-116 


15 
13 


101 
108 


40.589 
57.255 


0.125 


0.038 


3.2/0 


2* 


281-305 


14 
24 
17 


281 
288 
297 


44.944 
15.203 
32.857 


0.12 


0.038 


3.139 


3 


431-447 


8 

26 
4 


431 
438 
439 


80.217 
1 1 .861 
148.896 


0.073 


0.038 


1.914 


4 


557-579 


11 
19 
6 
18 


557 
560 
564 
571 


64.335 
24.937 
87.586 
32.765 


0.174 


0.038 


4.550 


5 


635-650 


10 
34 


635 
642 


69.552 
6.542 


0.125 


0.038 


3.270 


6 


755-767 


36 
35 


755 
759 


5.599 
5.928 


0.154 


0.038 


4.025 


7 


838-854 


2 

28 


836 
846 


284.517 
11.426 


0.118 


0.038 


3.078 
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Table 60 

Predication of clusters for SCP-1 



Total AAs: 976 
Total 9-mers: 968 
Rammensee 16:118 9-mers 




i 
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1 1 376-447 



54 
43 
44 
20 
55 
6 

86 
45 
67 
21 
46 
68 
32 



376 
382 
386 
390 
397 
404 
407 
411 
417 
425 
431 
432 
438 



19 
20 
20 
22 
19 
24 
17 
20 
18 
22 
20 
18 
21 



0.194 



0.121 



1.608 



I 12 


455-488 


33 


455 


21 


0.235 


0.121 


1 .946 j 




t 


47 


459 


20 












56 


462 


19 












87 


463 


17 












88 


466 


17 












14 


470 


23 












109 


473 


16 












34 


480 


21 










515-530 


~ 57 


515 


19 


0.125 


0.121 


' A AO/1 I 






22 


522 


22 








r 14 


557-590 


8 


557 


24 


0.147 


0.121 


1.216 I 






23 


564 


22 












9 


571 


24 












90 


575 


17 












58 


582 


19 








15 


610-625 


69 


610 


18 






1 034 I 






91 


617 


17 








I ™~ 


633-668 


92 


633 


17 


0.222 










10 


635 


24 












70 


638 


18 












93 


640 


17 












48 


642 


20 












49 


645 


20 












111 


652 


16 












112 


660 


16 










674-685 


71 


674 


18 


0.167 


oizi 


1.379 




11 


677 


24 








18 


687-702 


1 


687 


26 


0.125 


0.121 


1.034 ~] 






94 


694 


17 








r 19 


744-767 


113 


744 


16 


0.250 


0.121 


2.068 




95 


745 


17 






* n 






4 


745 


25 












24 


752 


22 












2 


755 


26 












72 


759 


18 








20 


812-827 


97 


812 


17 


0.125 


0.121 


1 .034 






115 


819 


16 









21 838-857 



0.150 



25 
74 



846 
849 



22 
18 



4 
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1 MLLAVLYCLL WSFQTSAGHF PRACVSSKNL MEKECCPPWS GDRSPCGQLS 
GRGSCQNILL 

5 61 SNAPLGPQFP FTGVDDRESW PSVFYNRTCQ CSGNFMGFNC GNCKFGFWGP 

NCTERRLLVR 

121 RNIFDLSAPE KDKFFAYLTL AKHTISSDYV IPIGTYGQMK NGSTPMFNDI 

NIYDLFVWMH ^ „,^r,™«™ 

181 YYVSMDALLG GSEIWRDIDF AHEAPAFLPW HRLFLLRWEQ EIQKLTGDEN 

10 FTIPYWDWRD „ T _ rM „„^„ T „ 

241 AEKCDICTDE YMGGQHPTNP NLLSPASFFS SWQIVCSRLE EYNSHQSLCN 

GTPEGPLRRN _ 

301 PGNHDKSRTP RLPSSADVBF CLSLTQYESG SMDKAANFSF RNTLEGFASP 

LTGIADASQS 

15 36i SMHNALHIYM NGTMSQVQGS ANDPIFLLHH AFVDSIFEQW LRRHRPLQEV 

YPEANAPIGH 

421 NRESYMVPFI PLYRNGDFFI SSKDLGYDYS YLQDSDPDSF QDYIKSYLEQ 
ASRIlflSWLLG 

481 AAMVGAVLTA LLAGLVSLLC RHKRKQLPEE KQPLLMEKED YHSLYQSHL 



20 



TYROSINASE PROTEIN 



25 



30 1 MNGDDAFARR PTVGAQIPEK IQKAFDDIAK YFSKEEWEKM KASEKIFYVY 

MKRKYE AMT K 

61 LGFKATLPPF MCNKRAEDFQ GNDLDNDPNR GNQVERPQMT FGRLQGISPK 
IMPKKPAEEG 

121 NDSEEVPEAS GPQKDGKELC PPGKPTTSEK ISERSGPKRG EHAWTHRLRE 

35 RKQLVIYEEI 

181 SDPEEDDE 

SSX-2 PROTEIN 



40 
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1 MWNLLHETDS AVATARRPRW LCAGALVLAG GFFLLGFLFG WFIKSSNEAT 

NITPKHNMKA 

61 FLDELKAENI KKFLYNFTQI PHLAGTEQNF QLAKQIQSQV* KEFGLDSVEL 

5 AHYDVLLSYP 

121 NKTHPNYISI INEDGNEIFN TSLFEPPPPG YENVSDIVPP FSAFSPQGMP 

EGDLVYVNYA 

181 RTEDFFKLER DMKINCSGKI VIARYGKVFR GNKVKNAQLA GAKGVILYSD 
PADYFAPGVK 

10 241 SYPDGWNLPG GGVQRGNILN LNGAGDPLTP GYPANEYAYR RGIAEAVGLP 

SIPVHPIGYY 

301 DAQKLLEKMG GSAPPDSSWR GSLKVPYNVG PGFTGNFSTQ KVKMHIHSTN 
EVTRIYNVIG 

361 TLRGAVEPDR YVILGGHRDS WVFGGIDPQS GAAWHEIVR SFGTLKKEGW 

■ 

15 RPRRTILFAS 

421 WDAEEFGLLG STEWAEENSR LLQERGVAYI NADSSIEGNY TLRVDCTPLM 

YSLVHNLTKE 

481 LKSPDEGFEG KSLYESWTKK SPSPEFSGMP RISKLGSGND FEVFFQRLGI 
ASGRARYTKN 

20 541 WETNKFSGYP LYHSVYETYE LVEKFYDPMF KYHLTVAQVR GGMVFELANS 

IVLPFDCRDY 

601 AWLRKYADK IYSISMKHPQ EMKTYSVSFD SLFSAVKNFT EIASKFSERL 
QDFDKSNPIV 

661 LRMMNDQLMF LERAFIDPLG LPDRPFYRHV IYAPSSHNKY AGESFPGIYD 

25 ALFDIESKVD 

721 PSKAWGEVKR QIYVAAFTVQ AAAETLSEVA 



PSMA PROTEIN 



30 

1: Homo sapiens tyrosinase PubMed, Protei: 

NM 00037 (oculocutaneous albinism IA) (TYR) , Taxonomy, OMIM, LinkO 
2 MrnA 

LOCUS NMJ300372 1964 bp mRNA PRI 31- 

OCT-2000 

DEFINITION Homo sapiens tyrosinase (oculocutaneous albinism IA) 
35 (TYR) / mRNA. 

ACCESSION NM_000372 

VERSION NM_000372.1 GI: 4507752 

KEYWORDS 

SOURCE human . 

40 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi ■ 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

45 REFERENCE 1 (bases 1 to 1964) 

AUTHORS Kwon BS, Haq AK, Pomerantz SH and Halaban R. 
TITLE Isolation and sequence of a cDNA clone for human 

tyrosinase that 

maps at the mouse c-albino locus 
50 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (21), 7473-7477 

(1987) 

MEDLINE 88041128 

PUBMED 2823263 
REMARK Erratum: [ [published erratum appears in Proc Natl Acad 

55 Sci USA 
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1988 Sep;85(17) :6352] ] 
REFERENCE 2 (bases 1 to 1964) 

AUTHORS Barton DE, Kwon BS and Francke U. 

TITLE Human tyrosinase gene, mapped to chromosome 11 (ql4 

-q21), 

defines second region of homology with mouse 

chromosome 7 



Genomics 3 (1) , 17-24 (1988) 
89122007 
3146546 

3 (bases 181 to 1964) 

Shibahara,S. , Tomita,Y., Tagami,H., Muller,R.M. and 

Molecular basis for the heterogeneity of human 

Tohoku J. Exp. Med. 156 (4), 403-414 (1988) 
89222868 

4 (bases 1 to 1964) 

Bouchard B, Fuller BB, Vijayasaradhi S and Houghton 



JOURNAL 

MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 
Cohen, T. 

TITLE 
tyrosinase 

JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 
AN. 

TITLE Induction of pigmentation in mouse fibroblasts by 

expression of 

human tyrosinase cDNA 
JOURNAL J. Exp. Med. 169 (6), 2029-2042 (1989) 
MEDLINE 89279151 
PUBMED 2499655 
REFERENCE 5 (bases 1 to 1964) 

AUTHORS Takeda,A., Tomita,Y., Okinaga,S., Tagami,H. and 

Shibahara , S . 

TITLE Functional analysis of the cDNA encoding human 
tyrosinase precursor 

JOURNAL Biochem. Biophys. Res. Commun. 162 (3), 984-990 (1989) 

MEDLINE 89351001 
REFERENCE 6 (bases 1 to 1964) 

AUTHORS Kikuchi H, Miura H, Yamamoto H, Takeuchi T, Dei T and 

Watanabe M. 

TITLE Characteristic sequences in the upstream region of the 



human 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



tyrosinase gene 

Biochim. Biophys. Acta 1009 (3), 283-286 (1989) 

90089403 

2480811 



7 (bases 1 to 1964) 
Giebel LB, Strunk KM and Spritz RA. 
Organization and nucleotide sequences of the human 

tyrosinase gene 

and a truncated tyrosinase-related segment 

Genomics 9 (3) , 435-445 (1991) 

91236163 



JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

Lethe B, 

TITLE 
autologous 

JOURNAL 



1903356 

8 (bases 1 to 1964) 

Brichard V, Van Pel A, Wolfel T, Wolfel C, De Plaen E, 
Coulie P and Boon T. 

The tyrosinase gene codes for an antigen recognized by 

cytolytic T lymphocytes on HLA-A2 melanomas 
J. Exp. Med. 178 (2), 489-495 (1993) 
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MEDLINE 93340625 
PUBMED 8340755 
COMMENT PROVISIONAL REFSEQ : This record has not yet been 

subject to final _ _ _ _ 

5 NCBI review. The reference sequence was derived from 

M27160.1 . 

FEATURES Location/Qualif iers 

source 1..1964 

/organism= w Homo sapiens " 
10 / db_xref=* n t axon : 9 6 0 6 * 

/ chromosome= n 11 " 
/map=*llql4-q21" 

/cell_line=*MeWo melanoma DNA, and cDNA to 

TORJtJA" 

15 /tissue_type* "placenta" 

gene 1..1964 

/gene="TYR" 
/note^OCAIA" 
/ db_xr e f = w Lo cus ID : 7299 tt 
20 /db_xref«"MIM: 203100 " 

CDS 83.. 1672 

/gene="TYR" 

/EC_number= w l . 14 . 18 . 1" 
/note° "Tyrosinase" 
25 /codon_start=*l 

/dbjxref ~"LocusID : 7299 " 
/db_xref= "MIM : 203100 * 

/product =" tyrosinase (oculocutaneous albinism 

IA)" 

30 /protein_id= " NP 000363.1 " 

/db_xref= tt GI: 4507753" 

/trans 1 at i on= w MIjLAVLYCLIjWS FQT S AGHF PRACVS SKNLMEKECCPPWSGDRS 

35 PCGQIjSGRGS CQNI llsnaplgpqf PFTGVDDRE SWPSVFYNRTCQCSGNFMGFNCGN 

CKFGFWGPNCTERRLLVRRNI FDLSAPEKDKFFAYLTIiAKHTI SSDYVI PIGT YGQMK 

NGSTPMFND INI YDLFVWMHYYVSMDALLGGSE I WRD IDFAH^ 

EQEIQKLTGDENFTI PYWDWRDAEKCDI CTDEYMGGQHPTNPNXiSPASFFSSWQIVC 
SRLEEYNSHQS LCNGT PEGPLRRNPGNHDKSRTPRLP S SADVE FCL SLTQYESGSMDK 
45 AANFSFRNTLEGFASPLTGIADASQSSMHNALHIYMNGTMSQVQGSAND 

VDSIFEQW^RRHRPLQEVYPEANAPIGHNRESYMVPFIP 

YLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQI.P 
50 EEKQPLLMEKEDYHSLYQSHL" 

sig peptide 83.. 136 

mat peptide 137.. 1669 

" A /note="tyrosinase" 

/product= w tyrosinase (oculocutaneous albinism 

55 IA) " 

misc feature 560.. 1429 

" /note«= "tyrosinase; Region: Common central 

domain of 



40 
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tyrosinase" 

BASE COUNT 520 a 462 C 445 g 537 t 

ORIGIN w»- 

1 atcactgtag tagtagctgg aaagagaaat ctgtgactcc aattagccag 

5 ttcctgcaga 

61 ccttgtgagg actagaggaa gaatgctcct ggetgttttg tactgcctgc 

tgtggagttt 

121 ccagacctcc gctggccatt tccctagagc ctgtgtctcc tctaagaacc 

tgatggagaa ^ j ... 

10 181 ggaatgctgt ccaccgtgga gcggggacag gagtccctgt ggccagcttt 

caggcagagg ^ 

241 ttcctgtcag aatatccttc tgtccaatgc accacttggg cctcaatttc 

ccttcacagg 

301 ggtggatgac cgggagtcgt ggccttccgt cttttataat aggacctgcc 

15 agtgctctgg 

361 caacttcatg ggattcaact gtggaaactg caagtttggc ttttggggac 

caaactgcac 

421 agagagacga ctcttggtga gaagaaacat cttcgatttg agtgccccag 

agaaggacaa . * . _ 

20 481 attttttgcc tacctcactt tagcaaagca taccatcagc tcagactatg 

tcatccccat 

541 agggacctat ggccaaatga aaaatggatc aacacccatg tttaacgaca 

tcaatattta ^„„*.#-r» 
601 tgacctcttt gtctggatgc attattatgt gtcaatggat gcactgcttg 

25 ggggatctga , , 

661 aatctggaga gacattgatt ttgcccatga agcaccagct tttctgcctt 

ggcatagact . 

721 cttcttgttg cggtgggaac aagaaatcca gaagctgaca ggagatgaaa 

acttcactat 

30 781 tccatattgg gactggcggg atgcagaaaa gtgtgacatt tgcacagatg 

agtacatggg ^ . . ^. . 

841 aggtcagcac cccacaaatc ctaacttact cagcccagca tcattcttct 

cctcttggca 

901 gattgtctgt agccgattgg aggagtacaa cagccatcag tctttatgca 

35 atggaacgcc 

961 cgagggacct ttacggcgta atcctggaaa ccatgacaaa tccagaaccc 

caaggctccc ^ .^i.^*-^ 

1021 ctcttcagct gatgtagaat tttgcctgag tttgacccaa tatgaatctg 

gttccatgga . 
40 1081 taaagctgcc aatttcagct ttagaaatac actggaagga tttgctagtc 

cacttactgg 

1141 gatagcggat gcctctcaaa gcagcatgca caatgccttg cacatctata 

t9aat 120l°aatgtcccag gtacagggat ctgccaacga tcctatcttc cttcttcacc 

45 atgcatttgt 

1261 tgacagtatt tttgagcagt ggctccgaag gcaccgtcct cttcaagaag 

tttatccaga 

1321 agccaatgca cccattggac ataaccggga atcctacatg gttcctttxa 

taccactgta . . 

50 1381 cagaaatggt gatttcttta tttcatccaa agatctgggc tatgactata 

gctatctaca 

1441 agattcagac ccagactctt ttcaagacta cattaagtcc tatttggaac 

aagcgagtcg . . 

1501 gatctggtca tggctccttg gggcggcgat ggtaggggcc gtcctcactg 

55 ccctgctggc 

1561 agggcttgtg agcttgctgt gtcgtcacaa gagaaagcag cttcctgaag 

aaaagcagcc 
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1621 actcctcatg gagaaagagg 

aaaaggctta 

1681 ggcaatagag tagggccaaa 

gtccaggttc 
5 1741 ccagagaata tctgctggta 

aacctaatac 

1801 aaagtgtagc cttfcttccaa 

ctgttttcac 

1861 tcagcccttt taacattttc 

10 ctatttggta 

1921 atgaggaact gttatttgta 



attaccacag cttgtatcag agccatttat 
aagcctgacc tcactctaac tcaaagtaat 
tttttctgta aagaccattt gcaaaattgt 
ctcaggtaga acacacctgt ctttgtcttg 
ccctaagccc atatgtctaa ggaaaggatg 
tgtgaattaa agtgctctta tttt 



1- Homo sapiens synovial sarcoma, PubMed, Pr otein, Relate 

NM 0031 x breakpoint 2 (SSX2) , mRNA Sequences, Taxonomy, OMIM 

— LinkOu 

47 

15 LOCUS NM_003147 766 bp mRNA PRI 14- 

MAR-2001 ~" /eovn\ 

DEFINITION Homo sapiens synovial sarcoma, X breakpoint 2 (.SSX2 ) , 

mRNA. 

ACCESSION NMJJ03147 

20 VERSION NMJ)03147.1 GI: 10337582 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

25 Eut e 1 eos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 766) 

AUTHORS Shipley JM, Clark J, Crew AJ, Birdsall S, Rocques PJ, 

30 Gil1 S ' _ , . ■« 

Chelly J, Monaco AP, Abe S, Gusterson BA and et al. 

TITLE The t (X; 18) (pll . 2 ;qll . 2) translocation found in human 
synovial 

sarcomas involves two distinct loci on the X 

35 chromosome 

JOURNAL Oncogene 9 (5), 1447-1453 (1994) 

MEDLINE 94203675 

PUBMED 8152806 

REFERENCE 2 (bases 1 to 766) 

40 AUTHORS Crew, A. J., Clark, J., Fisher, C, Gill,S., Grimer,R., 

Chand,A. , 

Shipley, J. , Gusterson, B. A. and Cooper, C.S. 
TITLE Fusion of SYT to two genes, SSX1 and SSX2, encoding 

proteins with , 
45 homology to the Kruppel -associated box in human 

synovial sarcoma 

JOURNAL EMBO J. 14 (10), 2333-2340 (1995) 

MEDLINE 95292974 
REFERENCE 3 (bases 1 to 766) 
50 AUTHORS Tureci 0, Sahin U, Schobert I, Koslowski M, Scmitt H, 

Schild HJ, 

Stenner F, Seitz G, Rammensee HG and Pfreundschuh M. 
TITLE The SSX-2 gene, which is involved in the t(X;18) 
translocation of 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



FEATURES 

source 



gene 



tnisc feature 



CDS 



synovial sarcomas, codes for the human tumor antigen 

HOM-MEL-40 

JOURNAL Cancer Res. 56 (20), 4766-4772 (1996) 
MEDLINE 96438636 
PUBMED 8840996 
COMMENT PROVISIONAL REFSEQ : This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

X86175.1 . 

Location/ Qualifiers 
1..766 

/organism= u Homo sapiens" 
/db_xref = w taxon: 9606" 
/ chromosome = "X " 
/map- *Xpll . 23 -pll . 22" 
/dev_stage= *adult" 
1. .766 

/gene= w SSX2" 

/note« w HD21; HOM-MEL-40; SSX" 
/db_xref= "Locus ID : 6757 tt 
/ dbjcref = X1 MIM : 300192 n 
20. .61 

/note= w Kruppel aaociated box homology" 

92. .658 
/gene=*SSX2" 

/codonjstart=l 

/dbjcr e f = n Locus ID : 6757 n 

/dbxref = "MIM : 3 0 0192 " 

/product= %1 synovial sarcoma, X breakpoint 2" 
/pr oteinjLd= » NP 003138. 1 * 
/db_xref= n GI: 10337583" 

/ 1 r ans 1 at ion- *MNGDDAFARRPTVGAQI PEKI QKAFDDI AKYF SKEEWEKMKAS B 

KI FYVYMKRKYEAMTKLGFKATLP P FMCNKRAED FQGNDLDND PNRGNQVERPQMT FG 

RLQGI S PKIMPKKPAEEGNDS EEVPEASGPQNDGKELCPPGKPTTS EKIHERSGPKRG 

EHAWTHRLRERKQLVI YEE I SDPEEDDE" 
misc feature 161.. 337 

— — - /note="KRAB; Region: krueppel associated box" 

BASE COUNT 229 a 181 c 200 g 156 t 

ORIGIN 

1 ctctctttcg attcttccat actcagagta cgcacggtct gattttctct 
ttggattctt 

61 ccaaaatcag agtcagactg ctcccggtgc catgaacgga gacgacgcct 
ttgcaaggag 

121 acccacggtt ggtgctcaaa taccagagaa gatccaaaag gccttcgatg 
atattgccaa 

181 atacttctct aaggaagagt gggaaaagat gaaagcctcg gagaaaatct 
tctatgtgta 

241 tatgaagaga aagtatgagg ctatgactaa actaggtttc aaggccaccc 
tcccaccttt 

301 catgtgtaat aaacgggccg aagacttcca ggggaatgat ttggataatg 
accctaaccg 

361 tgggaatcag gttgaacgtc ctcagatgac tttcggcagg ctccagggaa 
tctccccgaa 

421 gatcatgccc aagaagccag cagaggaagg aaatgattcg gaggaagtgc 
cagaagcatc 
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481 tggcccacaa aatgatggga aagagctgtg ccccccggga aaaccaacta 
cctctgagaa 

541 gattcacgag agatctggac ccaaaagggg ggaacatgcc tggacccaca 
gactgcgtga 

601 gagaaaacag ctggtgattt atgaagagat cagcgaccct . gaggaagatg 
acgagtaact 

661 cccctcaggg atacgacaca tgcccatgat gagaagcaga acgtggtgac 
ctttcacgaa 

721 catgggcatg gctgcggacc cctcgtcatc aggtgcatag caagtg 



1- Homo sapiens folate hydrolase PubMed, Protein, Relate 

m 004 (prostate-specific membrane antigen) Sequences, Taxonomy , OMIM 
476 1 (FOLHl) f mRNA LinkOu 
LOCUS NMJ)04476 2653 bp mRNA PRI 01- 

NOV-2000 

15 DEFINITION Homo sapiens folate hydrolase (prostate- specific 

membrane antigen) 

1 (FOLHl) , mRNA. 
ACCESSION NM_004476 
VERSION NM_004476.1 GI:4758397 

20 KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; . 
25 Mammalia; Eutheria; Primates; Catarrhani; Hommidae; 

Homo. 

REFERENCE 1 (bases 1 to 2653) 

AUTHORS Israeli, R.S., Powell, C.T., Fair,W.R. and Heston, W.D. 

TITLE Molecular cloning of a complementary DNA encoding a 

30 prostate-specific membrane antigen 

JOURNAL Cancer Res. 53 (2) , 227-230 (1993) 

MEDLINE 93113576 

REFERENCE 2 (bases 1 to 2653) 

AUTHORS Rinker-Schaeffer CW, Hawkins AL, Su SL, Israeli RS, 

35 Griffin CA, 

Isaacs JT and Heston WD. 
TITLE Localization and physical mapping of the prostate- 
specific membrane 

antigen (PSM) gene to human chromosome 11 
40 JOURNAL Genomics 30 (1) , 105-108 (1995) 

MEDLINE 96129312 
PUBMED 8595888 
REFERENCE 3 (bases 1 to 2653) 

AUTHORS O'Keefe DS, Su SL, Bacich DJ, Horiguchi Y, Luo Y, 

45 Powell CT, 

Zandvliet D, Russell PJ, Molloy PL, Nowak NJ, Shows 

TB, Mullins C, 

Vonder Haar RA, Fair WR. and Heston WD. 
TITLE Mapping, genomic organization and promoter analysis of 

50 the human 

prostate-specific membrane antigen gene 
JOURNAL Biochim. Biophys. Acta 1443 (1-2), 113-127 (1998) 
MEDLINE 99057583 
PUBMED 9838072 
55 REFERENCE 4 (bases 1 to 2653) 
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AUTHORS Maraj BH, Leek JP, Karayi M, Ali M, Lench NJ and 

Markham AF. j , . . . ^ ^of-at-p. 

TITLE Detailed genetic mapping around a putative prostate 

specific merabrane antigen locus on human chromosome llpll.2 

JOURNAL Cytogenet. Cell Genet. 81 (1), 3-9 (1998) 

MEDLINE 98358137 

PUBMED 9691167 

COMMENT PROVISIONAL RSFSEQ : This record has not yet been 

subject to final , A 

NCBI review. The reference sequence was derived trom 

M99487.1. § 

Location/ Qualifiers 

1. .2653 

/organism^ *Homo sapiens" 
/db_xref « ■ t axon : 96 0 6" 
I chromos ome= w 1 1 " 
/map= w llpll.2" 
/sex="male" 

/cell_line="LNCaP-ATCC w 
/cellj:ype= "prostate" 

/tissue_type« w prostatic carcinoma metastatic 



FEATURES 

source 



lymph node" 
gene 



CDS 



membrane 



membrane 



/tissue_lib="LNCaP cDNA of Ron Israeli" 

1. .2653 
/gene=*FOLHl" 
/note=*FOLH; PSM; PSMA" 
/db_xref= n LocusID:2346 w 

/db_xref = "MIM : 600934 w 
262 . .2514 
/gene= n FOLHl" 

/note= "folate hydrolase 1 (prostate-specific 

antigen) " 

/codon_start=l 

/ dbjxref = "Locus ID : 2346 * 

/db_xref= "MIM : 600934 n 

/evidence=experimental t 
/product= n folate hydrolase (prostate-specific 

antigen) 1" 

/protein_id= " NP 004467. 1 * 
/db_xref =*GI : 4758398" 



/ 1 r ans 1 a t i on= M MWNLLHETD SAVATARRPRTOCAGALVLAGGFFLLGFLFGWF I K 
SSNEATNITPKHNMKAFLDELKAENIKKFLYNFTQ I PHLAGTEQNFQLAKQIQSQWKE 
FGLDS VELAHYDVLLS YPNKTHPNYI S I INEDGNE I FNTSLFEPPPPGYENVSDTVPP 
FSAFSPQGMPEGDLVYVNYARTEDFFKLERDMKJNCSGKIVIARYGKVro 
LAGAKGVILYSD PADYFAPGVKSYPDGWNLPGGGVQRGNI LNLNGAGDPLTPGYPANE 
YAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKVPYNVGPGFT 
GNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGA 
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AWHE I VRS FGTLKKEGWRPRRTI LFASWDAEEFGLLGSTEWAEENSRLLQERGVAYI 
NADSSJEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKS1.YESWTKKSPSPEFSG 
MPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGyPLYHSVYETYELVEKFY 
DPMFKyHLTVAQVRGGMVFELANSIVLPFDCRDYAWLRKYADKIYSISMKHPQEMKT 
YSVSFDSIjFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFIiERAFIDPLGLP 
DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESK7DPSKAWGEVKRQIYVAAFTVQ 



AAAETLSEVA* 

misc feature 778.. 1029 
j5 /note= n PA; Region: PA domain" 

BASE COUNT 782 a 524 C 640 g 707 t 

ORIGIN t . 

1 ctcaaaaggg gccggatttc cttctcctgg aggcagatgt tgcctctctc 

tctcgctcgg 

20 61 attggttcag tgcactctag aaacactgct gtggtggaga aactggaccc 

caggtctgga . , 

121 gcgaattcca gcctgcaggg ctgataagcg aggcattagt gagattgaga 

gagactttac 

181 cccgccgtgg tggttggagg gcgcgcagta gagcagcagc acaggcgcgg 

25 gtcccgggag 

241 gccggctctg ctcgcgccga gatgtggaat ctccttcacg aaaccgactc 

ggcfc.gt.ggcc 

301 accgcgcgcc gcccgcgctg gctgtgcgct ggggcgctgg tgctggcggg 
tggcttcttt 

30 361 ctcctcggct tcctcttcgg gtggtttata aaatcctcca atgaagctac 

taacattact 

421 ccaaagcata atatgaaagc atttttggat gaattgaaag ctgagaacat 
caagaagttc 

481 ttatataatt ttacacagat accacattta gcaggaacag aacaaaactt 

35 tcagcttgca . . 

541 aagcaaattc aatcccagtg gaaagaattt ggcctggatt ctgttgagct 

a9CaCa 601 gatgtcctgt tgtcctaccc aaataagact catcccaact acatctcaat 

aattaatgaa j . _ 

40 661 gatggaaatg agattttcaa cacatcatta tttgaaccac ctcctccagg 

atatgaaaat 

721 gtttcggata ttgtaccacc tttcagtgct ttctctcctc aaggaatgcc 
agagggcgat 

781 ctagtgtatg ttaactatgc acgaactgaa gacttcttta aattggaacg 

45 ggacatgaaa , , . 

841 atcaattgct ctgggaaaat tgtaattgcc agatatggga aagttttcag 

aggaaataag . 

901 gttaaaaatg cccagctggc aggggccaaa ggagtcattc tctactccga 

ccctgctgac , . . 

50 961 tactttgctc ctggggtgaa gtcctatcca gatggttgga atcttcctgg 

aggtggtgtc 

1021 cagcgtggaa atatcctaaa tctgaatggt gcaggagacc ctctcacacc 
aggttaccca 

1081 gcaaatgaat atgcttatag gcgtggaatt gcagaggctg ttggtcttcc 

55 aagtattcct 

1141 gttcatccaa ttggatacta tgatgcacag aagctcctag aaaaaatggg 

tggctcagca 
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1201 ccaccagata gcagctggag aggaagtctc aaagtgccct acaatgttgg 
aCCt9 ?26lactggaaact tttctacaca aaaagtcaag atgcacatcc actctaccaa 

5 t9aa9 U2i a agaatttaca atgtgatagg tactctcaga ggagcagtgg aaccagacag 

atata i381 t ctgggaagtc accgggactc atgggtgttt ggtggtattg accctcagag 
t93a9 l5fl gttgttcatg aaattgtgag gagctttgga acactgaaaa aggaagggtg 

10 9a9aC j5o? a agaacaattt tgtttgcaag ctgggatgca gaagaatttg gtcttcttgg 
ttCta i561 9 tgggcagagg agaattcaag actccttcaa gagcgtggcg tggcttatat 

15 taat9 i621°tcatctatag aaggaaacta cactctgaga gttgattgta caccgctgat 

9taCa flsi 9 gtacacaacc taacaaaaga gctgaaaagc cctgatgaag gctttgaagg 

C 3.3. at c t c t 1 

1741 tatgaaagtt ggactaaaaa aagtccttcc ccagagttca gtggcatgcc 
20 Ca99a J aa f C aaa ttgggat ctggaaatga ttttgaggtg ttcttccaac gacttggaat 

t9Ct ieJl C agagcacggt atactaaaaa ttgggaaaca aacaaattca gcggctatcc 
25 aCt9fc l921 C agtgtctatg aaacatatga gttggtggaa aagttttatg atccaatgtt 

taaata 98i°ctcactgtgg cccaggttcg aggagggatg gtgtttgagc tagccaatto 

Cata9 2041 C ccttttgatt gtcgagatta tgctgtagtt ttaagaaagt atgctgacaa 
aa.tcta.ca.Qt 

2101 atttctatga aacatccaca ggaaatgaag acatacagtg tatcatttga 
ttCaC 2i6i t tctgcagtaa agaatfcttac agaaattgct tccaagttca gtgagagact 

35 CCa99 2221 gacaaaagca acccaatagt attaagaatg atgaatgatc aactcatgtt 
tCt99a 28i a gcatttattg atccattagg gttaccagac aggcctttfct ataggcatgt 
CatCt 23?l t ccaagcagcc acaacaagta tgcaggggag tcattcccag gaatttatga 

40 t9CtC 2^1 gatattgaaa gcaaagtgga cccttccaag gcctggggag aagtgaagag 
a ° a9a 2461 gttgcagcct tcacagtgca ggcagctgca gagactttga gtgaagtagc 

45 Cta8S 252a t tctttagaga atccgtattg aatttgtgtg gtatgtcact cagaaagaat 

C9taa 25fl t atattgataa attttaaaat tggtatattt gaaataaagt tgaatattat 

atataaaaaa 

2641 aaaaaaaaaa aaa 
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U20Q93 . Human melanocyte- . . . [gi : 1142634] 

LOCUS HSPMEL17S2 2817 bp DNA PRI 20- 

NOV-1996 

DEFINITION Human melanocyte -specific tpmel 17) gene, exons 2-5, 
5 and complete 

cds . 

ACCESSION U20093 
VERSION U20093.1 GI: 1142634 

KEYWORDS 
10 SEGMENT 2 of 2 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi ; 

15 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 2817) 

AUTHORS Kim, K. K. , Youn,B.S., Heng,H.H., Shi,X.M., Tsui,L.C, 

Lee,Z.H. , 

20 Pickard,R.T. and Kwon,B.S. 

TITLE Genomic organization and FISH mapping of human Pmel 

17, the 

putative silver locus 
JOURNAL Pigment Cell Res. 9 (1), 42-48 (1996) 
25 MEDLINE 96314705 

REFERENCE 2 (bases 1 to 2817) 
AUTHORS Kwon,B.S. 
TITLE Direct Submission 

JOURNAL Submitted (05- JAN-1995) Indiana University School of 
30 Medicine, 

Microbiology and Immunology, 635 Barnhill Drive, 

Indianapolis, IN 

46202, USA 

FEATURES Loca t ion/Qual i f i er s 

35 source 1..2817 

/organism= w Homo sapiens" 
/db_xref= "taxon : 9606" 
/ chromosome^ * 12 * 
/map= u 12pter-q21" 

40 /clone^lib-nambda FIX II from Stratagene" 

gene join(U19491 .1:1111. . 3109, 1 2722) 

/gene« n pmel 17" 

CDS 

join(U19491. 1:1111. .1186,67. .459,853. .1014,1266. .2396, 

45 2499.-2722) 

/gene«*pmel 17" 

/ codon_s t ar t = 1 

/product = w pmel 17" 

/protein_id=*AAB19181 . 1* 
50 /db_xref= w GI:114263 6" 

/ trans la t i on= * MDLVLKRCLLHLAVI GALLAVGATKVPRNQDWLGVSRQLRTKAWNRQL YPEW 
TEAQRLDCWRGGQVSLKVSNDGPTLIGANASFSIALNFPGSQKVLPDGQVIWVNNTIINGSQW 
QPVYPQETDDACI FPDGGPCPSGSWSQKRSFVYVWKTWGQYWQVLGGPVSGLS IGTGRAMLGTHTM 
55 EVTVYHRRGSRSYVPLAHSSSAFTITDQVPFSVSVSQLRALDGGNKHFLRNQ 

LAEADLSYTWDFGDS SGTLI SRAPVVTHTYLE PGPVTAQWLQAAI PLTS CGSSPVPGTTDGHRPT 
AEAPNTTAGQVPTTEWGTTPGQAPTAE PSGTTSVQVPTTEVI STAPVQMPTAESTGMTPEKVFVS 
EVMGTTLAEMSTPEATGMTPAEVS I WLSGTTAAQVTTTEWVETTARELPIPEPEGPDAS SIMSTE 



» 
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S ITGSLGPLLDGTATLRLVKRQVPLDCVLYRYGS FSVTLDI VQGI ESAEILQAVPSGEGDAFELTV 
S CQGGLPKEACME I S S PGCQPPAQRLCQPVLPS PACQLVLHQI LKGGSGTYCIiNVSLADTNSLAW 
S TQL IMPGQEAGLGQVPLI VGI LLVLMAVVLASL I YRRRLMKQ I FCS 

CP I GENS PLLiSGQQV" 

exon 67.. 459 

/gene« w pmel 17" 
/number= 2 
exon 853.. 1014 

/gene= w pmel 17" 
/number=3 
exon 1266.. 23 96 

/gene= w pmel 17" 
/number=4 

exon 



BASE COUNT 627 a 

ORIGIN 

1 gtgctaaaaa 

gtgggggcta 

61 caaaagtacc 
accaaagcct 

1-21 ggaacaggca 
tggagaggtg 

181 gtcaagtgtc 
aatgcctcct 

241 tctctattgc 
caggttatct 

3 01 gggtcaacaa 
gtgtatcccc 

361 aggaaactga 
ggctcttggt 

421 ctcagaagag 
ccttctcagc 

481 ctatcatcca 
ggccgcccct 

541 tccaccttaa 
ttttctcttt 

601 cttagcacct 
ttggcctcaa 

661 ttgtagtcct 
ccatatcaca 

721 gccttccaaa 
agfcgccgttt 

781 cgaagcctgt 
tttcattcca 

841 aaaatgcttc 
ctgagcattg 

901 ggacaggcag 
catcgccggg 

961 gatcccggag 
actggtaagg 

1021 gttcaggaag 
tggatggact 

1081 gcaaaggaga 
aagacctagg 

1141 cagagaaatg 
agttggaggg 



2499 . . >2722 
/gene= w pmel 
/ number ==5 
772 c 


17" 

739 g 679 t 




gatgccttct 


tcatttggct 


gtgataggtg 


ctttgtggct 


cagaaaccag 


gactggcttg 


gtgtctcaag 


gcaactcaga 


gctgtatcca 


gagtggacag 


aagcccagag 


acttgactgc 


cctcaaggtc 


agtaatgatg 


ggcctacact 


gattggtgca 


cttgaacttc 


cctggaagcc 


aaaaggtatt 


gccagatggg 


taccatcatc 


aatgggagcc 


aggtgtgggg 


aggacagcca 


cgatgcctgc 


atcttccctg 


atggtggacc 


ttgcccatct 


aagcttfcgtt 


tatgtctgga 


• 

agacctgggg 


tgagggactc 


cacttgtgtt 


tacttctttc 


tacctgatca 


cctttctttt 


cttctgtgat 


tttctctaat 


cttcattttc 


ctcttagatc 


agcccccttc 


aagctctatc 


ataattcttt 


ctggcaactc 


accccatgga 


atgcctcatt 


aggacccctt 


ccctgtcccc 


caccctcaga 


agtaatcata 


cttcctgacc 


tcccatctcc 


ccctcagtcc 


cctttgacca 


gtaatctctt 


cttccttgct 


aggccaatac 


tggcaagttc 


tagggggccc 


agtgtctggg 


ggcaatgctg 


ggcacacaca 


ccatggaagt 


gactgtctac 


ctatgtgcct 


cttgctcatt 


ccagctcagc 


cttcaccatt 


ggcaaggcca 


gttgtagggc 


aaagagaagg 


cagggaggct 


aaggtgaaat 


gctgtgcaaa 


cttaaagtag 


aagggccagg 


tgaggcttag 


tgccagtgaa 


gggccagcca 


gtcagcttgg 
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1201 tgtggctgtg aaaggagaag 

tggctccaat 

1261 cccagaccag gtgcctttct 

atggagggaa 
5 1321 caagcacttc ctgagaaatc 

accccagtgg 

1381 ctatctggct gaagctgacc 

gtggaaccct 

1441 gatctctcgg gcacctgtgg 
10 tcactgccca 

1501 ggtggtcctg caggctgcca 
ttccaggcac 

1561 cacagatggg cacaggccaa 
aagtgcctac 
15 1621 tacagaagtt gtgggtacta 

ctggaaccac 

1681 atctgtgcag gtgccaacca 
tgccaactgc 

1741 agagagcaca ggtatgacac 
20 gtaccacact 

1801 ggcagagatg tcaactccag 
caattgtggt 

1861 gctttctgga accacagctg 

ccacagctag 
25 1921 agagctacct atccctgagc 

ctacggaaag 

1981 tattacaggt tccctgggcc 

tggtgaagag 

2041 acaagtcccc ctggattgtg 

30 ccctggacat 

2101 tgtccagggt attgaaagtg 

agggggatgc 

2161 atttgagctg actgtgtcct 
tggagatctc 
35 2221 atcgccaggg tgccagcccc 

ccagcccagc 

2281 ctgccagctg gttctgcacc 
gcctcaatgt 

2341 gtctctggct gataccaaca 

40 tgcctggtag 

2401 gtccttggac agagactaag 

ggcaagcagc 

2461 agacatgagt gaagcagtgc 
gccttgggca 
45 2521 ggttccgctg atcgtgggca 

catctctgat 

2581 atataggcgc agacttatga 

atagcagcag 

2641 tcactggctg cgtctacccc 

50 atagccccct 

2701 cctcagtggg cagcaggtct 
tggagttgac 

2761 agaaacacct atatttcccc 

aaataaa 
55 // 



ctgtggctca 


ggcctggttc 


tcaccttttc 


ccgtgagcgt 


gtcccagttg 


cgggccttgg 


agcctctgac 


ctttgccctc 


cagctccatg 


tctcctacac 


ctgggacttt 


ggagacagta 


tcactcatac 


ttacctggag 


cctggcccag 


ttcctctcac 


ctcctgtggc 


tcctccccag 


ctgcagaggc 


ccctaacacc 


acagctggcc 


cacctggtca 


ggcgccaact 


gcagagccct 


ctgaagtcat 


aagcactgca 


cctgtgcaga 


ctgagaaggt 


gccagtttca 


gaggtcatgg 


aggctacagg 


tatgacacct 


gcagaggtat 


cacaggtaac 


aactacagag 


tgggtggaga 


ctgaaggtcc 


agatgccagc 


tcaatcatgt 


ccctgctgga 


tggtacagcc 


accttaaggc 


ttctgtatcg 


atatggttcc 


ttttccgtca 


ccgagatcct 


gcaggctgtg 


ccgtccggtg 


gccaaggcgg 


gctgcccaag 


gaagcctgca 


ctgcccagcg 


gcfcgtgccag 


cctgtgctac 


agatactgaa 


gsfgtggctcg 


gggacatact 


gcctggcagt 


ggtcagcacc 


cagcttatca 


tgaggaggga 


agtggataga 


ggggacagct 


ctgggattct 


tctcacaggt 


caagaagcag 


tcttgctggt 


gttgatggct 


gcggticcctg 


agcaagactt 


* 

ctccgtaccc 


cagttgccac 


gcatcttctg 


ctcttgtccc 


attggtgaga 


gagtactctc 


atatgatgct 


gtgattttcc 


cagtcttccc 


tgggagacta 


ctattaactg 
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LOCUS NMJ)01648 1466 bp mRNA PRI 

31-OCT-2000 

DEFINITION Homo sapiens kallikrein 3, (prostate specific antigen) 
(KLK3) , 

5 mRNA. 

ACCESSION NM_0 01648 

VERSION NM_001648.1 GI:4502172 
KEYWORDS 

SOURCE human . 

10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata ; Euteleostomi ; 

Mammalia/ Eutheria; Primates/ Catarrhini/ 

Hominidae ; Homo * 
15 REFERENCE 1 (bases 1 to 1466) 

AUTHORS Lundwall,A. and Lilja,H. 

TITLE Molecular cloning of human prostate specific antigen 

cDNA 

JOURNAL FEBS Lett. 214 (2), 317-322 (1987) 
20 MEDLINE 87190978 

REFERENCE 2 (bases 1 to 1466) 

AUTHORS Sutherland GR, Baker E, Hyland VJ, Callen DF, Close JA, 
Tregear GW, 

Evans BA and Richards RI . 
25 TITLE Human prostate -specific antigen (APS) is a member of 

the glandular 

kallikrein gene family at 19ql3 
JOURNAL Cytogenet. Cell Genet. 48 (4), 205-207 (1988) 
MEDLINE 89250658 
30 PUBMED 2470553 

REFERENCE 3 (bases 1 to 1466) 

AUTHORS Riegman PH, Klaassen P, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Molecular cloning and characterization of novel 

35 prostate antigen 

cDNA's 

JOURNAL Biochem. Biophys. Res. Commun. 155 (1), 181-188 (1988) 
MEDLINE 88326297 
PUBMED 2458104 
40 REFERENCE 4 (bases 1 to 1466) 

AUTHORS Schulz P, Stucka R, Feldmann H, Combriato G, Klobeck HG 

and Fittler F. 

TITLE Sequence of a cDNA clone encompassing the complete 

mature human 

45 prostate specific antigen (PSA) and an unspliced 

leader sequence 

JOURNAL Nucleic Acids Res. 16 (13), 6226 (1988) 
MEDLINE 88289366 
PUBMED 2456523 
50 REFERENCE 5 (bases 1 to 1466) 

AUTHORS Riegman PH, Vlietstra RJ, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Characterization of the prostate-specific antigen gene: 

a novel 

55 human kallikrein-like gene 

JOURNAL Biochem. Biophys. Res. Commun. 159 (1), 95-102 (1989) 
MEDLINE 89165891 
PUBMED 2466464 
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REFERENCE 6 (baaes 1 to 1466) 

AUTHORS Henttu P and Vihko P. 

TITLE cDNA coding for the entire human prostate specific 
antigen shows 

5 ~ high homologies to the human tissue kallikrem 
genes 

JOURNAL Biochem. Biophys. Res. Comrnun. 160 (2), 903-910 (1989) 

MEDLINE 89246551 

PUBMED 2470373 

10 REFERENCE 7 (bases 1 to 1466) 

AUTHORS Klobeck HG, Combriato G, Schulz P, Arbusow V and 

Fittler F. 

TITLE Genomic sequence of human prostate specific antigen 
(PSA) 

15 JOURNAL Nucleic Acids Res. 17 (10), 3981 (1989) 

MEDLINE 89282407 

PUBMED 2471958 

REFERENCE 8 (bases 1 to 1466) 

AUTHORS Lundwall A. 

20 TITLE Characterization of the gene for prostate -specific 

antigen, a human 

glandular kallikrein 
JOURNAL Biochem. Biophys. Res. Commun. 161 (3), 1151-1159 

(1989) 

25 MEDLINE 89302090 

PUBMED 2472789 

COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived 

30 from X05332.1. 

FEATURES Location/Qualifiers 
source 1. .1466 

/organism= ^Homo sapiens" 

/db_xref » w taxon: 9606" 
35 /chromosome=*19" 

/map«*19ql3" 

/clone= "lambda HPSA-1" 

/tissue_type= "pros tate" 

/clone_lib= tt { lambda) gt 11" 

40 gene 1. .1466 

/gene=> n KLK3 
/note="PSA; APS' 
/db_xref= v LocusID:354' 

/db_xre f = *MIM : 1 7 6 8 2 0 " 
45 sig_peptide 44.. 94 

CDS 44.. 829 

/gene^KLKS" 

/EC_number= w 3 .4.21.77" 

/note= tt antigen, prostate specific" 
50 /codon_ start=l 

/db_xref = "Locus ID : 354" 

/db_xref = *MIM : 17682 0" 

/product= "kallikrein 3, (prostate specific 

antigen) " 

5 5 /pr o t e in_i d= w NP_0 01639.1" 

/db_xref=*GI: 4502173" 



10 

i // 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



/trans lat i on= ^MWVFVVFLTLSVTWIGAAPLI LSRI VGGWBCEKHSQPWQVLVAS 
RGRAVCGGVLVHPQWVLTAAHC IRNKS VTIiLGRHS LFHPEDTGQVFQVSHS FPHPL YDMSLLKNRP 
LRPGDDSSHDLMLLRLSEPAELTDAVKVMDLPTQEPMjGTTCYASGWGSIEPEEFLTPKKLQCVDL 

HVTISNDVCAQvTIPQKvTKFMLCAGRWTGG 
YTKWHYRKWI KDT I VANP " 



variation 



variation 



misc_feature 
variation 



misc_feature 

protease" 

misc_feature 

matjpeptide 

antigen) " 

variation 



variation 



variation 



variation 



variation 



variation 



variation 



91 



91 

/allele= w C" 
/allele= u T" 

/db_xre f = w dbSNP : 1 1 3 5 7 6 5 « 



variation 



/allele= w C 
/allele- n T' 
/db_xref= w dbSNP: 11573" 
95. .115 

/note=*propeptide (AA 1-7)" 
97 

/allele- "G" 
/allele- *A" 

/ dbjxr e f = n dbSNP ; 1 1 3 5 7 6 6 " 
113. .802 

/note=*Tryp_SPc ? Region: Trypsin-like serine 

116.. 802 . 

/note=*trypsin; Region: Trypsin" 
116. .826 

/product= w kallikrein 3, (prostate specific 

280 
/allele= n C" 
/allele=*T" 

/dbjxref=MbSNP: 1058072" 
280 
/allele= tt C" 
/allele="T" 

/db_xref = M dbSNP : 12946" 
526 
/allele= w A" 
/allele= tt G" 

/dbjxxef=*dbSNP: 1803130" 
643 

/allele= w A" 
/allele=*G" 

/ db_xr e f = ■ dbSNP :1803129" 
788 

/ no te« EARNING: map location ambiguous" 

/allele= w T" 

/allele- "C" 

/db_xref= u dbSNP: 1802720" 
844 
/allele- tt T" 
/allele=»C" 

/db_xref = w dbSNP : 1058205" 
894 

/allele="A" 
/allele=*G" 

/db_xref= w dbSNP: 1058274'' 
906 
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variation 
variation 
variation 
variation 
variation 
variation 
variation 
variation 

* 

variation 

variation 

variation 

variation 

itiisc_f eature 
polyA_j3ite 
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/allele=*T" 
/allele- "A" 

/ db_xr e f = w dbSNP : 1 8 0 3 1 3 2 " 
918 

/allele* "T" 
/alleles *C" 

/dbjxref="dbSNP: 1802722" 

953 
/ allele- n A" 
/allele- *G" 

/db_xr ef = w dbSNP :1802721" 
1017 

/allele- tt T" 
/allele= tt C" 

/dbjxref = "dbSNP : 18 03 13 8 " 
1074 

/allele-"C" 
/allele= w T" 

/db_xref=*dbSNP : 12101" 
1107 

/allele- U A" 
/allele= tt G" 

/db_xref= "dbSNP : 18 03 13 1" 
1107 

/allele-*G" 

/allele- tt A" 

/ db__xre f = "dbSNP : 6 9 9 8 " 
1211~ 

/allele- n C" 
/ allele- «T" 

/ db_xr e f = " dbSNP :1803136" 
1229 

/allele=*A" 
/allele- *G" 

/db_xref=MbSNP: 1803135" 
1249 

/allele- "T" 
/allele= tt C" 

/db_xref= w dbSNP: 18 03134" 

1273 
/allele- tt A" 
/allele- tt G" 

/di>_xref=*dbSNP : 12040" 
1326 

/allele= w T" 
/allele=*C ff 

/db_xref=*dbSNP: 1803133" 
1344 

/allele- "A" 
/allele="G" 

/db_xref= *dbSNP : 12863 " 
1363 

/allele="A" 
/allele- *G" 

/db_xref= tt dbSNP: 1803137" 

1445. .1450 
/note= tt put .polyA signal" 
1466 
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/note= w polyA site" 
BASE COUNT 338 a 382 c 422 g 324 t 

ORIGIN 

1 agccccaagc ttaccacctg cacccggaga gctgtgtgtc accatgtggg 

5 tcccggttgt 

61 cttcctcacc ctgtccgtga cgtggattgg tgctgcaccc ctcatcctgt 

ctcggattgt 

121 gggaggctgg gagtgcgaga agcattccca accctggcag gtgcttgtgg 
cctctcgtgg 

10 181 cagggcagtc tgcggcggtg ttctggtgca cccccagtgg gtcctcacag 

ctgcccactg ^ . . . . 

241 catcaggaac aaaagcgtga tcttgctggg tcggcacagc ctgtttcatc 

ctgaagacac 

301 aggccaggta tttcaggtca gccacagctt cccacacccg ctctacgata 

15 tgagcctcct 

361 gaagaatcga ttcctcaggc caggtgatga ctccagccac gacctcatgc 

tgctccgcct 

421 gtcagagcct gccgagctca cggatgctgt gaaggtcatg gacctgccca 
cccaggagcc 

20 481 agcactgggg accaccfcgct acgcctcagg ctggggcagc attgaaccag 

aggagttctt 

541 gaccccaaag aaacttcagt gtgtggacct ccatgttatt tccaatgacg 
tgtgtgcgca 

601 agttcaccct cagaaggtga ccaagttcat gctgtgtgct ggacgctgga 

25 cagggggcaa 

661 aagcacctgc tcgggtgatt ctgggggccc acttgtctgt aatggtgtgc 

ttcaaggtat 

721 cacgtcatgg ggcagtgaac catgtgccct gcccgaaagg ccttccctgt 
acaccaaggt 

30 781 ggtgcattac cggaagtgga tcaaggacac catcgtggcc aacccctgag 

caccccfcafcc 

841 aaccccctat tgtagtaaac ttggaacctt ggaaatgacc aggccaagac 
tcaagcctcc 

901 ccagttctac tgacctttgt ccttaggtgt gaggtccagg gttgctagga 

35 aaagaaatca 

961 gcagacacag gtgtagacca gagtgtttct taaatggtgt aattttgtcc 

tctctgtgtc 

1021 ctggggaata ctggccatgc ctggagacat atcactcaat ttctctgagg 
acacagatag 

40 1081 gatggggtgt ctgtgttatt tgtggggtac agagatgaaa gaggggtggg 

atccacactg 

1141 agagagtgga gagtgacatg tgctggacac tgtccatgaa gcactgagca 
gaagctggag 

1201 gcacaacgca ccagacactc acagcaagga tggagctgaa aacataaccc 

45 actctgtcct 

1261 ggaggcactg ggaagcctag agaaggctgt gagccaagga gggagggtct 

tcctttggca 

1321 tgggatgggg atgaagtaag gagagggact ggaccccctg gaagctgatt 
cactatgggg 

50 13 81 ggaggtgtat tgaagtcctc cagacaaccc tcagatttga tgatttccta 

gtagaactca 

1441 cagaaataaa gagctgttat actgtg 

// 
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U87459. Human autoimmunog. [gi : 1890098] 

LOCUS HSU87459 752 bp mRNA PRI 

22-DEC-1999 
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DEFINITION Human auto immunogenic cancer/testis antigen NY-ESO- 
1 mRNA, complete cds. 
ACCESSION U87459 

VERSION U87459.1 GI: 1890098 

5 KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata ; But eleostomi ; 
10 Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae; Homo. 

REFERENCE 1 (bases 1 to 752) 

AUTHORS Chen,Y.T. f Scanlan,M . J. , Sahin,U., Tureci,0., 

Gure,A.O., Tsang,S., 
15 Williamson, B. , Stockert,E., Pf reundschuh,M. and 

Old, L.J. 

TITLE A testicular antigen aberrantly expressed in human 

cancers detected 

by autologous antibody screening 
20 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (5), 1914-1918 (1997) 

MEDLINE 972 03161 

PUBMED 9050B7 9 

REFERENCE 2 (bases 1 to 752) 

AUTHORS Chen , Y . - T . 
25 TITLE Direct Submission 

JOURNAL Submitted (28 -JAN- 1997) Ludwig Institute for Cancer 

Research, New 

York Branch, 1275 York Avenue, New York, NY 10021, 

USA 

30 FEATURES Location/ Qualifiers 

source 1 . . 752 

/organism= tt Homo sapiens" 
/db_xref * * taxon : 9 6 0 6 ■ 

/clone_lib= v esophageal squamous cell 

35 carcinoma expression 

cDNA library" 

CDS 54.. 596 

/codon_start=l 

/product= "autoimmunogenic cancer/testis 

40 antigen NY-ESO-1" 

/protein_id=*AAB49693 . 1" 
/db_xref «»GI : 1890099" 

/ trans 1 at ion* "MQAEGRGTGGSTGDADGPGGPG I PDGPGGNAGGPGEAG 
45 ATGGRGPRGAGAARASGPGGGAPRGPHGGAASGLNGCCRCGARGPESRLLEF 

YLAMPFATPMEABLARRSLAQDAPPLPVPGVLLKEFTVSGNILTIRLTAADH 
RQLQLS I S S CLQQLSLLMWITQCFLPVFLAQ PPSGQRR" 
BASE COUNT 126 a 230 c 256 g 140 t 

ORIGIN 

50 1 atcctcgtgg gccctgacct tctctctgag agccgggcag aggctccgga 

gccatgcagg 

61 ccgaaggccg gggcacaggg ggttcgacgg gcgatgctga tggcccagga 
ggccctggca 

121 ttcctgatgg cccagggggc aatgctggcg gcccaggaga ggcgggtgcc 

55 acgggcggca 

181 gaggtccccg gggcgcaggg gcagcaaggg cctcggggcc gggaggaggc 

gccccgcggg 
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241 


gtccgcatgg 


cggcgcggct 


tcagggcnga 


acggacgccg 


caga t y eg y g 




gccagggggc 
301 


cggagagccg 


cctgcttgag 


ccCuaccccg 


CCatgCCttc. 


eg cy civjciouu 


5 


atggaagcag 
361 


agctggcccg 


caggagcctg 


gcccaggatg 


CCCCaCCgtL 


tec eg eyece*. 




ggggtgcttc 

421 


tgaaggagtt 


4h A» _TT_| M 

cactgtgtcc 


ggcaacacac 


CgaCtaCCCy 






gcagaccacc 
481 


gccaactgca 


gctctccatc 


agctcctgtc 


tccagcagct 


ttccctgttg 


10 


atgtggatca 
541 


cgcagtgctt 


tctgcccgtg 


tttttggctc 


agcctccctc 


agggcagagg 




cgctaagccc 
601 


agcctggcgc 


cccttcctag 


gtcatgcctc 


ctcccctagg 


gaatggtccc 


15 


agcacgagtg 
661 


gccagttcat 


tgtgggggcc 


tgattgtttg 


tcgctggagg 


aggaeggett 



acatgtttgt 

721 ttctgtagaa aataaaactg agctacgaaa aa 

// 



20 CAA11116. LAGE- la protein [ . . . [gi :3255959] 

LOCUS CAA11116 180 aa PRI 

23-JUN-1998 

DEFINITION LAGE -la protein [Homo sapiens] . 
ACCESSION CAA11116 
25 PID g3255959 

VERSION CAA11116.1 GI : 3255959 

DBSOURCE embl locus KOS223093 , accession AJ223 093 . 1 

KEYWORDS 

SOURCE human . 

30 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 
35 REFERENCE 1 (residues 1 to 180) 

AUTHORS Lethe , B . , Lucas , S . , Michaux ,L. , De Sme t ,C, 
Godelaine,D . , 

Serrano, A., De Plaen,E. and Boon,T. 

TITLE LAGE-1, a new gene with tumor specificity 

40 JOURNAL Int. J. Cancer 76 (6), 903-908 (1998) 

MEDLINE 98289662 

REFERENCE 2 (residues 1 to 180) 

AUTHORS Lethe , B . G . 

TITLE Direct Submission 

45 JOURNAL Submitted (08-JAN-1998) Lethe B.G., Brussels Branch, 

Ludwig 

Institute for Cancer Research, 74 Avenue 

Hippocrate, B - 1200 - 

Bruxelles, BELGIUM 
50 COMMENT Related sequences: AJ223040, AJ223041 and AJ003149- 

FEATURES Location/ Qualifiers 
source 1 . .180 

/organism= w Homo sapiens" 
. /isolate= "individual LB33" 
55 /db_xref="taxon:9606" 

/chromosome= W X" 
/map= w q28" 
/clone=*c2RB" 
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Protein 
CDS 



/sex=* w Female" 
/cell_line= tt LB33-MELA" 
/eel l_type« "melanoma" 
/clone_lib= u B6" 
1. .180 

/product= u LAGE-la protein" 

1. .180 

/gene= n LAGE-l" 
/ dbjKT ef - * SPTREMBL : 07 5 6 3 7 " 
/coded_by= w join (AJ223093 . 1 : 1231 . . 1499, 

AJ223093. 1:2115. . 2249 , AJ223093 . 1:2479. .2617)" 
ORIGIN 

1 mqaegrgtgg stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 

arasgprgga „ - ... 

61 prgphggaas aqdgrcpcga rrpdsrllel hitrapfsspm eaelvrrils 

rdaaplprpg 

121 avlkdftvsg nllfirltaa dhrqlqleis sclqqlsllm witqcflpvf 

laqapsgqrr 
181 

// 

CAA11U7. LAGE- lb protein [... [gi : 3255960] 
LOCUS CAA11117 210 aa 

23-JUN-1998 

DEFINITION LAGE - lb protein [Homo sapiens] . 
ACCESSION CAA11117 
PID g3255960 
VERSION CAA11117.1 GI: 3255960 

DBSOURCE embl locus HOS223093, accession AJ223093,1 
KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota ; Me t azoa ; Chordata ; 

Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; 

Hominidae; Homo. 

REFERENCE 1 (residues 1 to 210) 

AUTHORS Lethe ,B., Lucas, S., Michaux,L./ De 
Godelaine , D . , 

Serrano, A. , De Plaen,E. and Boon,T, 
LAGE-1, a new gene with tumor specificity 
Int. J. Cancer 76 (6), 903-908 (1998) 
98289662 



PRI 



Craniata; 
Catarrhini ; 

Smet,C. , 



TITLE 
JOURNAL 
MEDLINE 

REFERENCE 2 (residues 1 to 210) 



Lethe, B.G. 
Direct Submission 

Submitted (08-JAN-1998) Lethe B.G., Brussels Branch, 

for Cancer Research, 74 Avenue 



AUTHORS 
TITLE 
JOURNAL 
Ludwig 

Institute 

Hippocrate, B - 1200 - 

Bruxelles, BELGIUM 
COMMENT Related sequences: AJ223040, AJ223041 and AJ003149 
FEATURES Location/Qualifiers 
source 1 . .210 

/organism= tt Homo sapiens" 
/isolate= w individual LB33" 
/db^xref^taxon^eoe' 



■ft 
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Protein 
CDS 



/chromosorae= *X" 
/map= tt q28" 
/clones »c2RB" 
/sex= w Female" 
/cell_line=*LB33 -MELA" 
/cell_type= "melanoma" 
/clone_lib="B6" 
1. .210 

/product= u LAGE-lb protein" 

1. .210 
/gene="LAGE-l" 
/ db_xr e f = * SPTREMBL : 07 5 6 3 8 " 
/coded_by= w join(AJ223093 .1:1231. .1499, 
AJ223093. 1:2115. .2478)" 

/note*** alternate splicing at exon 2 - intron 

2 boundary" 
ORIGIN 

1 mqaegrgtgg stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 

arasgprgga m 

61 prgphggaas aqdgrcpcga rrpdsrllel hitmpfsspm eaelvrrils 

rdaaplprpg 

121 avlkdftvsg nllfmsvwdq dregagrmrv vgwglgsasp egqkardlrt 

pkhkvseqrp 

181 gtpgppppeg aqgdgcrgva fnvmfsaphi 

// 



M77481. Human antigen (MA. . . [gi: 416114] 

LOCUS HUMMAG1A 2420 bp DNA PRI 

15 -NOV- 19 93 

DEFINITION Human antigen (MAGE-1) gene, complete cds. 
ACCESSION M77481 

VERSION M77481.1 GI: 416114 

KEYWORDS antigen. 

SOURCE Homo sapiens (individual_isolate patient MZ2) 

melanoma abdominal 

metastasis of melanoma DNA. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata ; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 

REFERENCE 1 (bases 785 to 1286) 

AUTHORS van der Bruggen , P . , Tr aver sari , C . , Chomez , P . , 
Lurquin,C, De 

Plaen,E., Van den Eynde,B., Knuth,A. and Boon,T. 
TITLE A gene encoding an antigen recognized by cytolytic T 

lymphocytes on 

a human melanoma 
JOURNAL Science 254, 1643-1647 (1991) 
MEDLINE 92086861 
REFERENCE 2 (bases 1 to 2420) 
AUTHORS van der Bruggen P. 
TITLE Direct Submission 

JOURNAL Submitted (05-FEB-1992) Pierre van der Bruggen, Ludwig 
Institute 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



for Cancer Research, Brussels Branch, Avenue 

Hippocrate, 74, UCL 

7459, Brussels, B-1200, Belgium 
On Nov 15, 1993 this sequence version replaced 



COMMENT 
gi: 187294. 
FEATURES 

source 



intron 



exon 



mRNA 



intron 



exon 

gene 
CDS 



Location/ Qualifiers 
1..2420 

/organism^ M Homo sapiens" 
/db_xref =*taxon: 9606" 

/map- W X" 
/gex= u female" 
/cell_line=*MZ2-MEL.43" 

1..412 
/partial 
/gene= n MAGE-l" 
/numberal 
413. .485 

/gene=*MAGE-l" 
/number=2 
join(413. .485,561. .2111) 
/partial 
/gene^MAGE-l" 
486. .560 
/gene=*MAGE-l" 
/number =2 
561. .2111 
gene= u MAGE-l" 
/number =3 
626. .1555 
/gene= M MAGE-l" 
626.. 1555 
/gene= u MAGE-l" 
/codon_start=l 
/produc t = n MAGE - 1 " 
/protein_id= tt AAA03229 . 1" 
/db_xref= n GI: 416115" 

/ trans lat ion= W MSLEQRSLHCKPEEALEAQQEALGLVCVQAATSSSSPL 
VLGTLEEVPTAGSTDP PQS PQGASAFPTT INFTRQRQP SEGS S SREEEGPST 
SCILESLFRAVITKKVADLVGFLIiLKYRAREPVTKAEMLESVIKNYKHCFPE 
I FGKASESLQLVFGIDVKEADPTGHSYVLVTCLGLSYDGLIjGDNQIMPKTGF 
LIIVLVMIAMEGGHAPEEEIWEELSVMEVYDGREHSAYGEPRICLLTQDLVQE 
KYLEYRQVPDSDPARYEFLWGPRALAETSYVKVLEYVIKVSARVRFFFPSLR 

EAALREEEEGV" 
polyA_signal 2095 . . 2100 

/gene=*MAGE-l" 
intron 2112.. 2420 

/partial 

gene^MAGE-l" 

/number=3 

BASE COUNT 562 a 582 c 677 g 599 t 

ORIGIN 

1 ggatccaggc cctgccagga aaaatataag ggccctgcgt gagaacagag 

ggggtcatcc 

61 actgcatgag agtggggatg tcacagagtc cagcccaccc tcctggtagc 
actgagaagc 

121 cagggctgtg cttgcggtct gcaccctgag ggcccgtgga ttcctcttcc 
tggagctcca 
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181 ggaaccaggc agtgaggcct 
gcagaggatg 

241 cacagggtgt gccagcagtg 
ccacctgcca 
5 301 caggacacat aggactccac 

cctgtagaat 

361 cgacctctgc tggccggctg 
aggttttcag 

421 gggacaggcc aacccagagg 
10 ccaaggagaa 

481 gatctgtaag taggcctttg 
aggcctctca 

541 cacactccct ctctccccag 
ccacactcct 
15 601 gcctgctgcc ctgacgagag 

actgcaagcc 

661 tgaggaagcc cttgaggccc 
aggctgccac 

721 ctcctcctcc tctcctctgg 
20 ctgggtcaac 

781 agatcctccc cagagtcctc 

acttcactcg 

841 acagaggcaa cccagtgagg 

gcacctcttg 
25 901 tatcctggag tccttgttcc 

tggttggttt 

961 tctgctcctc aaatatcgag 
tggagagtgt 

1021 catcaaaaat tacaagcact 
30 agtccttgca 

1081 gctggtcttt ggcattgacg 
atgtccttgt 

1141 cacctgccta ggtctctcct 
tgcccaagac 
35 1201 aggcttcctg ataattgtcc 

ctcctgagga 

1261 ggaaatctgg gaggagctga 

acagtgccta 

1321 tggggagccc aggaagctgc 
40 tggagtaccg 

1381 gcaggtgccg gacagtgatc 
gggccctcgc 

1441 tgaaaccagc tatgtgaaag 

gagttcgctt 
45 1501 tttcttccca tccctgcgtg 

tctgagcatg 

1561 agttgcagcc aaggccagtg 

ggccgcgtcc 

1621 agcagcttcc cctgcctcgt 
50 agagcggtca 

1681 gtgttctcag tagtaggttt 
tttgttctct 

1741 tttggaattg ttcaaatgtt 
gcatccaagt 
55 1801 ttatgaatga cagcagtcac 

agtcttgtgt 

1861 tttattcaga ttgggaaatc 
gcagtggaat 



tggtctgaga 


cagtatcctc 


aggtcacaga 


aatgtttgcc 


ctgaatgcac 


accaagggcc 


agagtctggc 


ctcacctccc 


tactgtcagt 


taccctgagt 


accctctcac 


ttcctccttc 


acaggattcc 


ctggaggcca 


cagaggagca 


ttagagtctc 


caaggttcag 


ttctcagctg 


gcctgtgggt 


cttcattgcc 


cagctcctgc 


tcatcatgtc 


tcttgagcag 


aggagtctgc 


aacaagaggc 


cctgggcctg 


gtgtgtgtgc 


tcctgggcac 


cctggaggag 


gtgcccactg 


agggagcctc 


cgcctttccc 


actaccatca 


gttccagcag 


ccgtgaagag 


gaggggccaa 


gagcagtaat 


cactaagaag 


gtggctgatt 


ccagggagcc 


agtcacaaag 


gcagaaatgc 


gttttcctga 


gatcttcggc 


aaagcctctg 


tgaaggaagc 


agaccccacc 


ggccactcct 


atgatggcct 


gctgggtgat 


aatcagatca 


tggtcatgat 


tgcaatggag 


ggcggccatg 


atcrtQatcrcra 


qgtgtatqat 


gggagggagc 


tcacccaaga 


tttgqtgcag 


gaaaagtacc 


ccacacQcta 


tgagttcctg 


tggggtccaa 


tccttgagta 


tgtgatcaag 


gtcagtgcaa 


aaqcaqcttt 


qagagaggag 

• 


gaagagggag 


qqaggqqqac 


tgggccagtg 


caccttccag 


gtgacatgag 


k 1 1 ft 

gcccattctt 


cactctgaag 


ctgttctatt 


gggtgacttg 


gagatttatc 


tttttttaag 


ggatggttga 


atgaacttca 


acagttctgt 


gtatatagtt 


taagggtaag 


cattctattt 


tgtgaattgg 


gataataaca 
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1921 aagtacttag aaatgtgaaa 
ctaaagaaat 

1981 taagagatag tcaattcttg 
ttaaagatat 

2041 atgcatacct ggatttcctt 
tctgaataaa 

2101 gaattcttcc tgttcactgg 

ctttttggaa 

2161 ggccctgggt tagtagtgga 
ccatagggtc 

2221 gtagagtcta ggagctgcag 
aaagatgtag 

2281 ggaaaagtga gagaggggtg 

agtgtcaatg 

2341 ccctgagctg gggcattttg 

gagctgattg 

2401 taatgatctt gggtggatcc 

// 



aatgagcagt 
ccttatacct 
ggcttctttg 
ctcttttctt 
gatgctaagg 
tcacgtaatc 
agggtgtggg 
ggctttggga 



aaaatagatg 
cagtctattc 
agaatgtaag 
ctccatgcac 
taagccagac 
gaggtggcaa 
gctccgggtg 
aactgcagtt 



agataaagaa 
tgtaaaattt 
agaaattaaa 
tgagcatctg 
tcatacccac 
gatgtcctct 
agagtggtgg 
ccttctgggg 



PRI 



L18920. Human MAGE -2 gene. [gi: 436180] 
LOCUS HUMMAGE2X 4559 bp DNA 

20-APR-1994 

DEFINITION Human MAGE -2 gene exons 1-4 , complete cds. 
ACCESSION L18920 
VERSION L18920.1 Gl: 436180 

KEYWORDS 

SOURCE Homo sapiens (human) . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 

Vertebrata ; Euteleostonvi ; 

Mammalia; Eutheria; Primates; 

Hominidae ; Homo . 

REFERENCE 1 (bases 1 to 4559) 

AUTHORS De Smet,C, Lurquin,C, van der Bruggen,P., De 
Plaen.E. , 

Brasseur,F. and Boon,T. 
TITLE Sequence and expression pattern of the human MAGE2 gene 

JOURNAL Immunogenetics 39 (2), 121-129 (1994) 
MEDLINE 94102805 

FEATURES Location/ Qualifiers 



Craniata; 
Catarrhini ; 



source 



gene 
exon 

intron 

exon 



1. .4559 

/organism= w Homo sapiens" 
/isolate=*MZ-2" 
/db_xref = 11 taxon : 960 6" 
/map=*X" 

/haplotype= H Al A29 B37 B44 C6" 

/sex= "female" 

/cell JLine= n MZ2 -MEL . 43 • 

/eel l~type:= "melanoma cell line" 
561. .4496 

/gene= tt MAGE-2" 
561. .754 

/gene= tt MAGE-2" 

/number=l 
755. .1850 

/gene^MAGE-2" 

/numbers 1 
1851. .1969 
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/gene= n MAGE-2" 
/number =2 
1970. .2788 
/gene=*MAGE-2" 
/number =2 
2789. .2854 
/gene=*MAGE-2" 
/number =3 

2855. .2934 
/gene= n MAGE-2" 
/number=3 
2935. .4496 
/gene="MAGE-2" 
/number«4 
3000. .3944 

/gene= n MAGE~2" 
/ codon_s tart = 1 
/protein_id«*AAA17729 . 1" 
/db_xref=«GI: 436181" 

/ translation* n MPLEQRSQHCKPEEGLEARGEALGLVGAQAPATEEQQTASSSSTLVEVT 

LGEVPAADSPSPPHSPQGASSFSTTINYTLWRQSDEGSSNQEEEGPRMFPDLE 

SEFQAAISRKMVELVHFLLLKYRAREPVTKAEMM 

VVEVVPISHLYILVTCLGLSYDGLLGDNQVMPKTGIjLI I VLAI IAIEGDCAPEEKIWEELSML 
25 BVFEGREDSVFAHPRKIjIiMQDLVQKNYLEYRQV^ 
KIGGEPHI SYPPLHERALREGEE" 

polyA signal 4479. .4484 

/gene- M MAGE-2" 
BASE COUNT 1049 a 1280 c 1283 g 947 t 

30 ORIGIN 

1 attccttcat caaacagcca ggagtgagga agaggaccct cctgagtgag 
gactgaggat 

61 ccaccctcac cacatagtgg gaccacagaa tccagctcag cccctcttgt 
cagccctggt 

35 121 acacactggc aatgatctca ccccgagcac acccctcccc ccaatgccac 

ttcgggccga 

181 ctcagagtca gagacttggt ctgaggggag cagacacaat cggcagagga 
tggcggtcca 

241 ggctcagtct ggcatccaag tcaggacctt gagggatgac caaaggcccc 

40 tcccaccccc 

301 aactcccccg accccaccag gatctacagc ctcaggatcc ccgtcccaat 

ccctacccct 

361 acaccaacac catcttcatg cttaccccca cccccccatc cagatcccca 
tccgggcaga 

45 421 atccggttcc acccttgccg tgaacccagg gaagtcacgg gcccggatgt 

gacgccactg 

481 acttgcacat tggaggtcag aggacagcga gattctcgcc ctgagcaacg 
gcctgacgtc 

541 ggcggaggga agcaggcgca ggctccgtga ggaggcaagg taagacgccg 

50 agggaggact 

601 gaggcgggcc tcaccccaga cagagggccc ccaataatcc agcgctgcct 

ctgctgccgg 

661 gcctggacca ccctgcaggg gaagacttct caggctcagt cgccaccacc 
tcaccccgcc 

55 721 accccccgcc gctttaaccg cagggaactc tggcgtaaga gctttgtgtg 

accagggcag 

781 ggctggttag aagtgctcag ggcccagact cagccaggaa tcaaggtcag 
gaccccaaga 



10 



intron 



exon 



intron 



exon 



15 CDS 
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841 ggggactgag ggcaacccac 
aacaccaacc 

901 ccacccccat ccctcaaaca 
tcctccccca 
5 961 ccaccatcct ggcagaatcc 

ctccgggaat 

1021 ggcggccaag cacgcggatc 

aaggggttgg 

1081 gtctcgtgag tatggccttt 

10 gaagacagtg 

1141 gagtccttag gggacccagc 

gtctcaaact 

1201 gagccacctt ttcattcagc 
cagcaggggg 
15 1261 ttggggccca gcctgcgagg 

aggggacctt 

1321 ggagtccaga tcagtggcaa 
ccgaatgtgc 

1381 cccgtgctca ttgcaccttc 

20 gagggcbggg 

1441 acttcaggtc agcagaggga 

tgcccccttc 

1501 atgaggactg gggatacccc 
tggaagtccc 
25 1561 ttgttcttag ctctggggga 

ctcatttgta 

1621 ccacaggcag gaggttgggg 

agaggagctg 

1681 tctgctcatt tcagggggtt 

30 agtaaagatg 

1741 agtaacccac aggaggccat 
agccctggac 

1801 aacgcacgtg ggggtaacag 

atctcaggga 
35 1861 gttgatgacc ttgttttcag 

tctggtcgac 

1921 agatgcagtg gttctaggat 

taggattgag 

1981 ggtacccctg ggccagaatg 

40 gcccctgcgg 

2041 ttacttcaga gaccctgggc 

ctgggatctt 

2101 tgatgtcagg gaaggggagg 
gtagagggag 
45 2161 ggtctcaggc cctgccagga 

ccaggacacc 

2221 tggactccaa tgaatttgga 

ggtcacgtat 

2281 ggccagatgt gggtcccctc 

50 gttcttgaca 

2341 tgagagattc tcaagccagc 
aaggtgaggg 

2401 ccctgagtga gcacagaggg 
acggagtctg 
55 2461 gccaaccctg ctgagacttc 

acactgaagg 

2521 cccgtgcatt cctctcccag 
ggccttggtc 



cccctaccct 


cactaccaat 


cccatccccc 


ccaaccccac 


ccccaaaccc 


cattcccatc 


ggctttgccc 


ctgcaatcaa 


cccacggaag 


ctgacgttca 


catgtacggc 


taagggaggg 


gggatgcaga 


ggaagggccc 


aggcctcctg 


atgccaggac 


agggggccca 


ctgtacccct 


cgagggaatc 


ctagggatgc 


agacccactt 


agtcaagggg 


aggaagaaga 


gggaggactg 


ccttgggctg 


ggggatcctg 


ggcacagtgg 


agggtgacag 


agagttgagg 


gctgtggtct 


ggaatcccag 


gatctgccgg 


acccaaggtg 


cggcccagaa 


agaagggatg 


ccacagagtc 


acctgatcag 


ggatggccct 


aagtgacaat 


aaccctcagg 


gagataaggt 


gttggtgtaa 


gggggttgag 


aaagggcagt 


ccctggcagg 


cataacgttc 


accctagaac 


caaaggggtc 


gatgtggccc 


ctcctcactt 


gtctttccag 


aaggtgactc 


aggtcaacac 


aggggcccca 


ctgccaagca 


tccaggtgga 


gagcctgagg 


cagcaagggg 


gccccataga 


aatctgccct 


agggctgtca 


gctgaagtcc 


ctccattatc 


ccttggtctg 


aaggggctgg 


agtcaggtca 


gtggacgtga 


ggaccaagcg 


gactcgtcac 


catctctcgt 


tgtccttcgc 


gggaggacct 


atatccttcc 




w-y y y » y y & 


aaaagggtgg 


gattaggccc 


tacaaggaga 


gaccctccac 


ccaagtagag 


tggggacctc 


tgggaatccg 


tggctgtgct 


tgcagtctgc 


gaatcaggag 


ctccaggaac 


caggcagtga 



WO 03/008537 



174 



PCT/US02/10189 



2581 tgagtcagtg tcctcaggtc 

cactgaaggt 

2641 ttgcctggaa tgcacaccaa 

ccagagggcc 
5 2701 tggcctcacc ctccctattc 

ggctgtaccc 

2761 tgaggtgccc tcccacttcc 

aagtaggacc 

2821 cgaggcactg gaggagcatt 

10 cagagcctcc 

2881 aaggttcagt tcagttctca 

ccaggcctgt 

2941 gggtcttcat tgcccagctc 

agagtcatca 
15 3001 tgcctcttga gcagaggagt 

gcccgaggag 

3061 aggccctggg cctggtgggt 

accgcttctt 

3121 cctcttctac tctagtggaa 

20 tcaccgagtc 

3181 ctccccacag tcctcaggga 

actctttgga 

3241 gacaatccga tgagggctcc 

tttcccgacc 
25 33 01 tggagtccga gttccaagca 

cattttctgc 

3361 tcctcaagta tcgagccagg 

agtgtcctca 

3421 gaaattgcca ggacttcttt 

30 ttgcagctgg 

3481 tctttggcat cgaggtggtg 
cttgtcacct 

3541 gcctgggcct ctcctacgat 
aagacaggcc 
35 3601 tcctgataat cgtcctggcc 

gaggagaaaa 

3661 tctgggagga gctgagtatg 

gtcttcgcac 

3721 atcccaggaa gctgctcatg 

40 taccggcagg 

3781 tgcccggcag tgatcctgca 

ctcattgaaa 

3841 ccagctatgt gaaagtcctg 

cacatttcct 
45 3901 acccacccct gcatgaacgg 

gcacatgttg 

3961 cagccagggc cagtgggagg 

catccattag 

4021 cttccactgc ctcgtgtgat 

50 gcagtcagca 

4081 ttcttagcag tgagtttctg 

ctttcctgtt 

4141 ggaattgttc aaatgttcct 

ccaagtttat 
55 4201 gaatgacagt agtcacacat 

tcctgttttt 

4261 tattcagatt gggaaatcca 
gcagtggaat 



acagagcaga ggggacgcag acagtgccaa 

gggccccacc cgcccagaac aaatgggact 

tcagtcctgc agcctgagca tgtgctggcc 

tccttcaggt tctgaggggg acaggctgac 

gaaggagaag atctgtaagt aagcctttgt 

cctaaggcct cacacacgct ccttctctcc 

ctgcccgcac tcctgcctgc tgccctgacc 

cagcactgca agcctgaaga aggccttgag 

gcgcaggctc ctgctactga ggagcagcag 

gttaccctgg gggaggtgcc tgctgccgac 

gcctccagct tctcgactac catcaactac 

agcaaccaag aagaggaggg gccaagaatg 

gcaatcagta ggaagatggt tgagttggtt 

gagccggtca caaaggcaga aatgctggag 

cccgtgatct tcagcaaagc ctccgagtac 

gaagtggtcc ccatcagcca cttgtacatc 

ggcctgctgg gcgacaatca ggtcatgccc 

ataatcgcaa tagagggcga ctgtgcccct 

ttggaggtgt ttgaggggag ggaggacagt 

caagatctgg tgcaggaaaa ctacctggag 

tgctacgagt tcctgtgggg tccaagggcc 

caccatacac taaagatcgg tggagaacct 

gctttgagag agggagaaga gtgagtctca 

gggtctgggc cagtgcacct tccagggccc 

atgaggccca ttcctgcctc tttgaagaga 

ttctgttgga tgactttgag atttatcttt 

tttaacaaat ggttggatga acttcagcat 

agtgctgttt atatagttta ggggtaagag 

ttccattttg tgagttgtca cataataaca 
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4321 atgtatttgc 
ggaactcaaa 

4381 agatagttaa 
tatgtgtatg 

4441 tttttgcttc 
ctgttcactg 

4501 gctcatttct 
agtagtggg 

// 



ctatattgtg aacgaattag cagtaaaata catgatacaa 

ttcttgcctt atacctcagt ctattatgta aaattaaaaa 

tttgagaatg caaaagaaat taaatctgaa taaattcttc 

ttaccattca ctcagcatct gctctgtgga aggccctggt 



PRI 



Craniata; 
Catarrhini; 



U03735. Human MAGE -3 anti . . . [gi :468825] 
LOCUS HSU03735 4204 bp DNA 

07 -APR- 1994 

DEFINITION Human MAGE -3 antigen (MAGB-3) gene, complete cds. 
ACCESSION U03735 

VERSION U03735.1 GI:468825 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 

Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; 

Hominidae ; Homo . 

REFERENCE 1 (bases 1 to 4204) 

AUTHORS Gaugler , B . , Van den Eynde , B . , van der Bruggen , P . , 
Romero, P. , 

Gaforio, J.J. , De Plaen,E., Lethe, B., Brasseur,F. 

and Boon,T. 

TITLE Human gene MAGE- 3 codes for an antigen recognized on 

a melanoma by 

autologous cytolytic T lymphocytes 
JOURNAL J. Exp. Med. 179, 921-930 (1994) 
MEDLINE 94157413 
REFERENCE 2 {bases 1 to 4204) 
AUTHORS Gaugler , B . 
TITLE Direct Submission 

JOURNAL Submitted (25-N0V-1993) Beatrice Gaugler, Ludv/ig 
Institute for 

Cancer Research, 74 Avenue Hippocrate, Brussels 

1200, Belgium 

FEATURES Location/Qualifiers 



source 



exon 
exon 
exon 
gene 



1. .4204 

/organism=*Homo sapiens" 
/isolate= "patient MZ2" 
/ db_xr e f = * t axon : 9 6 0 6 " 
/ chxomosome= u X" 
/sex= "female" 
/ c e 1 1_ t ype = * 1 ymphocy t e " 
/ ti ssue_type- "blood" 
/ de v_s t ag e = " adu 1 1 u 

434. .468 
/number =1 

2254. .2319 
/number=2 

2400. .3978 
/number =3 

2465. .3409 
/gene=*MAGE-3" 
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CDS 2465.. 3409 

/gene=*MAGE-3" 
/ codon_s t art l 
/product= w MAGE-3 antigen" 
5 /protein_id= u AAA17446.1" 

/dbjxref= w GI: 468826" 

/ 1 rans 1 a t i on= "MPLEQRSQHCKPE EGL.EARGEALGLVGAQAPATEEQEAAS S SSTLVE VTLGE 
VPAAESPDPPQSPQGASSLPTTMNYPLWSQSYEDSSNQEEEGPSTFPDLESEF 
10 FLLLKYRAREPVTKAEMLGSWGNWQYFFPVire 

SYDGLLGDNQIMPKAGUjI ivlai IAREGDCAPEEKIWEELSVLEVFEGREDSILGDPKKLLTQHF 
VQENYLEYRQVPGSDPAC YEFLWGPRALVETSYVKVLHHMVKI SGGPHI S YPPLHEWVLREGEE" 
polyA_signal 3958. .3963 
polyA_site 3979 
15 BASE COUNT 944 a 1144 C 1223 g 893 t 

ORIGIN 

1 acgcaggcag tgatgtcacc cagaccacac cccttccccc aatgccactt 
cagggggtac 

61 tcagagtcag agacttggtc tgaggggagc agaagcaatc tgcagaggat 

20 ggcggtccag 

121 gctcagccag gcatcaactt caggaccctg agggatgacc gaaggccccg 

cccacccacc 

181 cccaactccc ccgaccccac caggatctac agcctcagga cccccgtccc 
aatccttacc 

25 241 ccttgcccca tcaccatctt catgcttacc tccaccccca tccgatcccc 

atccaggcag 

301 aatccagttc cacccctgcc cggaacccag ggtagtaccg ttgccaggat 
gtgacgccac 

3 61 tgacttgcgc attggaggtc agaagaccgc gagattctcg ccctgagcaa 

30 cgagcgacgg 

421 cctgacgtcg gcggagggaa gccggcccag gctcggtgag gaggcaaggt 

aagacgctga 

481 gggaggactg aggcgggcct cacctcagac agagggcctc aaataatcca 
gtgctgcctc 

35 541 tgctgccggg cctgggccac cccgcagggg aagacttcca ggctgggtcg 

ccactacctc 

601 accccgccga cccccgccgc tttagccacg gggaactctg gggacagagc 
ttaatgtggc 

661 cagggcaggg ctggttagaa gaggtcaggg cccacgctgt ggcaggaatc 
40 aaggtcagga 

721 ccccgagagg gaactgaggg cagcctaacc accaccctca ccaccattcc 

cgtcccccaa 

781 cacccaaccc cacccccatc ccccattccc atccccaccc ccacccctat 
cctggcagaa 

45 841 tccgggcttt gcccctggta tcaagtcacg gaagctccgg gaatggcggc 

caggcacgtg 

901 agtcctgagg ttcacatcta cggctaaggg agggaagggg ttcggtatcg 
cgagtatggc 

961 cgttgggagg cagcgaaagg gcccaggcct cctggaagac . agtggagtcc 

50 tgaggggacc 

1021 cagcatgcca ggacaggggg cccactgtac ccctgtctca aaccgaggca 

ccttttcatt 

1081 cggctacggg aatcctaggg atgcagaccc acttcagcag ggggttgggg 
cccagccctg 

55 1141 cgaggagtca tggggaggaa gaagagggag gactgagggg accttggagt 

ccagatcagt 

12 01 ggcaaccttg ggctggggga tgctgggcac agtggccaaa tgtgctctgt 
gctcattgcg 
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1261 ccttcagggt gaccagagag 
caggtcagca 

1321 gagggaggaa tcccaggatc 

ccctatgtgg 
5 1381 tggacagatg cagtggtcct 

tgagggagga 

1441 ttgagggtac ccctgggaca 
gccctgctcc 

1501 tgctgttacc tcagagagcc 
10 ttatcctagg 

1561 atcactgatg tcagggaagg 
gggcagtaga 

1621 gggaggctct cagaccctac 
ctcacccagg 
15 1681 gtacatggac ttcaataaat 

gacctgggaa 

1741 tgtatggcca gatgtgggtc 
tgtgagttct 

1801 tgacatgaga gattctcagg 
20 ggagaaaggt 

1861 gagggccctg agtgagcaca 
acctcacaga 

1921 gtctggccaa ccctcctgac 
tctgcacatt 

25 1981 gggggcccgt ggattcctct 

agtgaggact 

2041 tggtctgagg cagtgtcctc 
gccaacggtg 

2101 aaggtttgcc ttggattcaa 
30 ggactccaga 

2161 gcgcctggcc tcaccctcaa 
tggccggatg 

2221 taccctgagg tgccctctca 
tgacctggag 
35 2281 gaccagaggc ccccggagga 

tttgttagag 

2341 cctccaaggt tccattcagt 
tctccccagg 

2401 ccagtgggtc tccattgccc 
40 tgaccagagt 

2461 catcatgcct cttgagcaga 
ttgaggcccg 

2521 aggagaggcc ctgggcctgg 
aggaggctgc 

45 2581 ctcctcctct tctactctag 

ccgagtcacc 

2641 agatcctccc cagagtcctc 
actaccctct 

2701 ctggagccaa tcctatgagg 
50 gcaccttccc 

2761 tgacctggag tccgagttcc 
tggttcattt 

2821 tctgctcctc aagtatcgag 
tggggagtgt 

55 2881 cgtcggaaat tggcagtatt 

gttccttgca 

2941 gctggtcttt ggcatcgagc 
acatctttgc 



ttgagygctg 


t" rrrff o> ^ on a or 

ty y t c t g day 


a off- of rjoa fi" t* 

a.y uyyy ctu u «- 


t* fir* a nni*i o o* o 


r» r> or or frrt" app 
ddyy ty uauu 


u^Laayyyyu 


ay y d t t t y Ct 


a a.oto»a trna or 
ddy u>clu t uay 


off- or a aorarrjaj r» 


<~rza a fr* oro»orora f~* 

yaauyCygaU 


4- or om or or r* o* o o» 

tgyyyytttt; 


af-a^jaaaf'o't" 
cl Laaaacx Ltu 


ototoi oia ototoi o» 

tyyy tayyy t 


tytcoyvtya 




ggaagcct eg 


guccgagggg 




taggagcgga 


ggugaggacc 


a a or a n 4~ ^ 

aagcagtCLC 


ttggacatct 


CCCyC uy lCC 


^ 4~ /^orrrrrt a or 

ut.cc cy gg ay 


******** 4^ a 4™ 4— 4* 

cccccacgcc 


f* 4~ i 1 ^ « *^ 

ULtCCyCaCC 


afaf oanrtf a 
auaLLaygud 


ccagcagaag 


gy aggga t ua 


ororo o»o< fsfas 
yyLuc uaUad 


gaggggatcc 


CLCaCCCC/dy 


+- o or a or otototot 


agccccggga 


aLCcg uygcc 


oroor t** t" f" oro« +" or 
ycy ttty uty 


cccaggaatc 


aggagcucca 


/*Y/—w a a ^ a a f^r\ o* 

ggaacaaggc 


aggccacaga 


gcagaggggg 


o»t~ oiaora a oft* 
1 1 v-oy a. tcty t 


accaagggcc 


UUauvLyuCL 


riarra a o"3 o»a t - 
wCLy CldU>CLUCl u 


4- q 4>» 4— riarrf" 

t act t t cayt 


/-t /-i cyst anpp 
ul> L.y ccty l. 


uay^ciuyuyo 




cagguucuga 


ororOrrT a o> a orof o 1 


gcactgaagg 


agaagacccg 


taagtaag cc 


actcagctga 


ggcctctcac 


atgctccctc 


agctcctgcc 


cacacccccg 


cccgucgccu 


ggagtcagca 


*t ^™ **t ** *** ** **% 4^ 

ccgcaagccc 


y aagaaggc c 


tgggtgcgca 


ggcccctgcc 


a nHoa nna /to* 

act gag gage 


cugaagccac 


cctgggggag 


oif" Oro* O't* oroi f" or 

y ty tt> tyt'ty 


agggagcctc 


cagccucccc 


a /"^^ a o*/"i a 4~ /*ra 

ac tat cat ga 


actccagcaa 


ccaagaagag 


gaggggecaa 


aagcagcact 


cagtaggaag 


gtggccgagt 


ccagggagcc 


ggtcacaaag 


geagaaatge 


tctttcctgt 


gatcttcagc 

* 


aaagcttcca 


tgatggaagt 


ggaccccatc 


ggccacttgt 
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3001 cacctgcctg ggcctctcct acgatggcct gctgggtgac aatcagatca 
tgcccaaggc 

3061 aggcctcctg ataatcgtcc tggccataat cgcaagagag ggcgactgtg 
cccctgagga 

5 3121 gaaaatctgg gaggagctga gtgtgttaga ggtgtttgag gggagggaag 

acagtatctt 

3181 gggggatccc aagaagctgc tcacccaaca tttcgtgcag gaaaactacc 
tggagtaccg 

3241 gcaggtcccc ggcagtgatc ctgcatgtta tgaattcctg tggggtccaa 
10 gggccctcgt 

3301 tgaaaccagc tatgtgaaag tcctgcacca tatggtaaag atcagtggag 
gacctcacat 

3361 ttcctaccca cccctgcatg agtgggtttt gagagagggg gaagagtgag 
tctgagcacg 

15 3421 agttgcagcc agggccagtg ggagggggtc tgggccagtg caccttccgg 

ggccgcatcc 

3481 cttagtttcc actgcctcct gtgacgtgag gcccattctt cactctttga 
agcgagcagt 

3541 cagcattctt agtagtgggt ttctgttctg ttggatgact ttgagattat 
20 tctttgtttc 

3601 ctgttggagt tgttcaaatg ttccttttaa cggatggttg aatgagcgtc 
agcatccagg 

3661 tttatgaatg acagtagtca cacatagtgc tgtttatata gtttaggagt 
aagagtcttg 

25 3721 ttttttactc aaattgggaa atccattcca ttttgtgaat tgtgacataa 

taatagcagt 

3781 ggtaaaagta tttgcttaaa attgtgagcg aattagcaat aacatacatg 
agataactca 

3841 agaaatcaaa agatagttga ttcttgcctt gtacctcaat ctattctgta 
30 aaattaaaca 

3901 aatatgcaaa ccaggatttc cttgacttct ttgagaatgc aagcgaaatt 
aaatctgaat 

3961 aaataattct tcctcttcac tggctcgttt cttttccgtt cactcagcat 
ctgctctgtg 

35 4021 ggaggccctg ggttagtagt ggggatgcta aggtaagcca gactcacgcc 

tacccatagg 

4081 gctgtagagc ctaggacctg cagtcatata attaaggtgg tgagaagtcc 
tgtaagatgt 

4141 agaggaaatg taagagaggg gtgagggtgt ggcgctccgg gtgagagtag 
40 tggagtgtca 

4201 gtgc 

// 

AF043498. Homo sapiens pros ... [gi : 2909843] 
45 LOCUS AF043498 990 bp mRNA PRI 24- 

FEB-1998 

DEFINITION Homo sapiens prostate stem cell antigen (PSCA) 
mRNA, complete cds. 
ACCESSION AF043498 
50 VERSION AF043498.1 GI: 2909843 

KEYWORDS 

SOURCE human , 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
55 Euteleostomi; 

Mammalia ; Eutheria ; Primates ; Catarrhini ; Hominidae / 

Homo. 

REFERENCE 1 (bases 1 to 990) 
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AUTHORS Reiter,R.E., Gu,Z., Watabe,T., Thomas, G., Kinga,S., 
Davis, E. , 

Wahl,M., Nisitani,S., Yamashiro, J. , Le Beau,M.M., 

Losa,M. and 
5 Witte,O.N. 

TITLE Prostate stem cell antigen: a cell surface marker 

overexpressed in 

prostate cancer 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 95 (4), 1735-1740 (1998) 
10 MEDLINE 98132661 

REFERENCE 2 (bases 1 to 990) 
AUTHORS Reiter,R.E. 
TITLE Direct Submission 

JOURNAL Submitted (19-JAN-1998) Urology, UCLA, 66-134 CHS 
15 10833 Le Conte 

Ave., Los Angeles, CA 90095, USA 
FEATURES Locat i on/Qual i f i ers 

source 1. .990 

/organism= u Homo sapiens" 
20 /db_xref = w taxon: 9606" 

I chromos ome = * 8 " 
/map="8q24.2" 

/note= w LAP04 prostate cancer xenograft" 
gene 1. .990 

25 /gene="PSCA" 

CDS 18.. 389 

/gene="PSCA" 

/note="GPI- anchored cell surface protein" 
/codon_start=l 

30 /product = "prostate stem cell antigen" 

/pro t ein_i d= *AAC3 9607.1" 
/db_xref="GI : 2909844" 



/ 1 ransl a t ion= "MKAVLLALLMAGLALQPGTALLCYSCKAQVSNEDCLQVENCTQLGEQC^ 
35 TARIRAVGLLTVT S KGCSLNCVDDSQDYYVGKKNITCCDTDLCNASGAHALQPAAAI LALLPA 

LGLLLWGPGQL" 

BASE COUNT 193 a 299 c 285 g 202 t 11 others 

ORIGIN 

1 agggagaggc agtgaccatg aaggctgtgc tgcttgccct gttgatggca 
40 ggcttggccc 

61 tgcagccagg cactgccctg ctgtgctact cctgcaaagc ccaggtgagc 
aacgaggact 

121 gcctgcaggt ggagaactgc acccagctgg gggagcagtg ctggaccgcg 
cgcatccgcg 

45 181 cagttggcct cctgaccgtc atcagcaaag gctgcagctt gaactgcgtg 

gatgactcac 

241 aggactacta cgtgggcaag aagaacatca cgtgctgtga caccgacttg 
tgcaacgcca 

3 01 gcggggccca tgccctgcag ccggctgccg ccatccttgc gctgctccct 
50 gcactcggcc 

361 tgctgctctg gggacccggc cagctatagg ctctgggggg ccccgctgca 
gcccacactg 

421 ggtgtggtgc cccaggcctt tgtgccactc ctcacagaac ctggcccagt 
gggagcctgt 

55 481 cctggttcct gaggcacatc ctaacgcaag tttgaccatg tatgtttgca 

ccccttttcc 

541 ccnaaccctg accttcccat gggccttttc caggattccn accnggcaga 
tcagttttag 
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601 tganacanat ccgcntgcag atggcccctc caaccntttn tgttgntgtt 
tccatggccc 

661 agcattttcc acccttaacc ctgtgttcag gcacttnttc ccccaggaag 
ccttccctgc 

5 721 ccaccccatt tatgaattga gccaggtttg gtccgtggtg tcccccgcac 

ccagcagggg 

781 acaggcaatc aggagggccc agtaaaggct gagatgaagt ggactgagta 
gaactggagg 

841 acaagagttg acgtgagttc ctgggagttt ccagagatgg ggcctggagg 
10 cctggaggaa 

901 ggggccaggc ctcacatttg tggggntccc gaatggcagc ctgagcacag 

cgtaggccct 

961 taataaacac ctgttggata agccaaaaaa 

// 

15 

P06870. GLANDULAR KALLIKR. .. [gi: 125170] 

LOCUS KLK1_HUMAN 262 aa PRI 20- 

AUG-2001 

DEFINITION GLANDULAR KALLIKREIN 1 PRECURSOR (TISSUE 

20 KALLIKREIN) 

( KIDNEY/ PANCREAS/ SAL I VARY GLAND KALLIKREIN) . 
ACCESSION P06870 
PID gl25170 
VERSION P06870 GI: 125170 

25 DBSOURCE swissprot: locus KLK1_HUMAN, accession P06870; 

class : standard . 

extra accessions :Q9UMJ1, created: Jan 1, 1988. 
sequence updated: Jan 1, 1988. 
annotation updated: Aug 20, 2001. 
30 xrefs: gi : gi : 186652, gi: gi: 186653, gi; gi: 186649, 

gi: gi: 

186651, gi: gi: 186645, gi: gi : 186646, gi: gi: 

186647, gi: gi: 

186648, gi: gi : 34026, gi : gi: 34027, gi : gi : 186643, 

35 gi: gi: 

386843, gi: gi : 67558 

xrefs (non-sequence databases) : HSSP P00757, MEROPS 

S01. 160, 

GlycoSuiteDB P06870, MIM 147910, InterPro IPR001314, 

40 InterPro 

IPR001254, Pfam PP00089, PRINTS PR00722, PROSITE 
PS 50240, PROSITE 

PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Glycoprotein; Multigene 

45 family ; 

Zymogen; Signal. 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 

50 Euteleostomi; 

Mammalia ; Euther ia ; Primates ; Cat arrhini ; Hominidae ; 

' Homo . 

REFERENCE 1 (residues 1 to 262) 

AUTHORS Fukushima , D . , Ki tamura , N • and Nakanishi , S . 
55 TITLE Nucleotide sequence of cloned cDNA for human 

pancreatic kallikrein 

JOURNAL Biochemistry 24, 8037-8043 (1985) 
REMARK SEQUENCE FROM N.A. 
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TISSUE=Pancreas 
REFERENCE 2 (residues 1 to 262) 

AUTHORS Evans, B .A. , Yun,Z.X., Close, J .A. , Tregear,G.W. , 

Kitamura,N. , 

Nakanishi,S. , Callen,D.F., Baker, E., Hyland,V.J., 
Sutherland , G . R . 

and Richards , R . I . 
TITLE Structure and chromosomal localization of the human 

renal 

kallikrein gene 

Biochemistry. 27 (9), 3124-3129 (1988) 
88269498 
2898948 

SEQUENCE FROM N.A. 
TISSUE=Kidney 

3 (residues 1 to 262) 

Angermann,A. , Bergmann,C. and Appelhans,H. 

Cloning and expression of human salivary-gland 

kallikrein in 

Escherichia coli 

The Biochemical journal. 262 (3), 787-793 (1989) 
90073574 
2686621 

SEQUENCE FROM N.A. 
TISSUE=Salivary gland 

4 (residues 1 to 262) 
Baker, A. R. and Shine, J. 

Human kidney kallikrein: cDNA cloning and sequence 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 



REFERENCE 

AUTHORS 

TITLE 
analysis 

JOURNAL 

MEDLINE 
PUBMED 

REMARK 

REFERENCE 

AUTHORS 

TITLE 
and sites of 

JOURNAL 
33 (4), 

MEDLINE 
PUBMED 
REMARK 



DNA (Mary Ann Liebert, Inc.) 4 (6) , 445-450 (1985) 
86135264 
3853975 

SEQUENCE OF 17-262 FROM N.A. 
TISSUE=Kidney 

5 (residues 1 to 262) 
Lu,H.S., Lin,F.K. , Chao f L. and Chao # J. 
Human urinary kallikrein. Complete amino acid sequence 

glycosylation 

International journal of peptide and protein research. 

237-249 (1989) 
89326688 
2666327 

SEQUENCE OF 25-262. 
TISSUE=Urine 

6 (residues 1 to 262) 
Kel lermann , J . , Lottspeich, F . , 



Geiger , R . and 



REFERENCE 
AUTHORS 
Deutzmann,R. 

TITLE Human urinary kallikrein- -amino acid sequence and 

carbohydrate 

attachment sites 

JOURNAL Protein sequences & data analysis. 1 (3), 177-182 

(1988) 

MEDLINE 88203586 

PUBMED 3163150 
REMARK SEQUENCE OF 25-262, AND CARBOHYDRATE -LINKAGE SITES. 

TISSUE=Urine 



WO 03/008537 



182 



PCT/US02/10189 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



REFERENCE 7 (residues 1 to 262) 

AUTHORS Lottspeich,F. , Geiger,R., Henschen,A. and Kutzbach,C. 

TITLE N-Terminal amino acid sequence of human urinary 

kallikrein homology 

with other serine proteases 

Hoppe-Seyler' s Zeitschrift fur physiologische Chemie. 



1947-1950 (1979) 

80114126 

393608 

SEQUENCE OF 25-55. 

TISSUE=Urine 

8 (residues 1 to 262) 

Takahashi , S . , lrie,A., Katayama,Y., Ito.K. and 



JOURNAL 
360 (12), 

MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
Miyake,Y. 

TITLE N- terminal amino acid sequence of human urinary 

prokallikrein 

JOURNAL Journal of biochemistry. 99 (3), 989-992 (1986) 
MEDLINE 86223893 

PUBMED 3 635530 
REMARK SEQUENCE OF 28-47, 

TISSUE=Urine 

[FUNCTION] GLANDULAR KALLIKREINS CLEAVE MET-LYS AND 
ARG-SER BONDS 

IN KININOGEN TO RELEASE LYS -BRADYKININ . 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE OF ARG- | - 

XAA BONDS IN 

SMALL MOLECULE SUBSTRATES. HIGHLY SELECTIVE ACTION TO 

RELEASE 

KALLIDIN (LYSYL-BRADYKININ) FROM KININOGEN INVOLVES 

HYDROLYSIS OF 

MET- | -XAA OR LEU- | -XAA. 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY SI; ALSO 

KNOWN AS THE 

TRYPSIN FAMILY. KALLIKREIN SUBFAMILY. 



FEATURES 

source 



Protein 



Region 



Region 



Region 



Bond 



Bond 



Location/Qualif iers 
1. .262 

/organism=*Homo sapiens" 
/ db_xref - " t axon : 9 6 0 6 " 
1. .262 
1. .262 

/product = "GLANDULAR KALLIKREIN 1 PRECURSOR" 
/EC_number="3 .4.21.35" 
1. .18 

/region_name= "Signal" 
/note= "PROBABLE." 
19. .24 

/region_name= "Propeptide" 

/note= "ACTIVATION PEPTIDE (PROBABLE) .* 

25. .262 

/regionjname="Mature chain" 
/note= "GLANDULAR KALLIKREIN 1. 
bond(31 / 174) 
/ bond_t ype - " di sul f i de 1 
/note="BY SIMILARITY. 
bond(50 l 66) 

/bond_type= "disulfide" 
/note="BY SIMILARITY . " 
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Site 



Site 



Site 



Site 



Site 



Site 



Region 



Bond 



Site 



Site 



Bond 



Region 



Bond 



Site 



it 



65 

/site_type= "active" 
/not e= "CHARGE RELAY SYSTEM . * 
93 

/site_type="glycosylation" 
/nOte="0-LINKED." 
102 

/ s ite_type= "glycosylat ion" 
/ no te="N- LINKED (GLCNAC...). 
104 

/site_type= "glycosylat ion" 
/note= "O-LINKED . * 
108 

/site_type="glycosylation" 
/note= "N- LINKED (GLCNAC . . . ) 
120 

/site_type»"active" 
/note= "CHARGE RELAY SYSTEM . * 
145 

/regionjiame= "Variant" 
/note="Q -> E . /FTId=VAR_006625 . 

bond(153,220) 
/bond_type« "disulfide 
/note="BY SIMILARITY. 
165 

/site_type="glycosylation" 

/note= "N-LINKED (GLCNAC . . . ) (PARTIAL) . 

167 

/site__type»"glycosylation" 
/not e="0 -LINKED." 
bond (185 ,199) 
/bond_type= "disulfide" 
/note="BY SIMILARITY." 
186 

/ r eg ionjiame = " Var i ant " 

/note="E -> K. /FTId=VARJ>06626 ." 

bond(210,235) 

/bond_type="disulfide" 

/note="BY SIMILARITY." 
214 

/site_type= "active" 

/not e= "CHARGE RELAY SYSTEM." 



/t 



ORIGIN 

1 nwflvlclal slggtgaapp iqsrivggwe ceqhsqpwqa alyhfstfqc 
ggilvhrqwv 

61 ltaahcisdn yqlwlgrlanl fddentaqfv bVBesfpbpg fnmsllenht 
rqadedyahd 

121 lmllrltepa dtitdavkw elptqepevg stclasgwgs iepenfsfpd 
dlqcvdlkil 

181 pndecekahv qkvtdfmlcv ghleggkdtc vgdeggplmc dgvlqgvtsw 
gyvpcgtpnk 

241 psvavrvlsy vkwiedtiae ns 

// 



55 



P0&217. ELASTASE 2A PRECU. . . tgi:119255] 
LOCUS EL2AJKUMAN 269 aa 

AUG-2001 



PRI 



20- 



WO 03/008537 
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PID 

VERSION 
DBSOURCE 



DEFINITION ELASTASE 2A PRECURSOR. 
ACCESSION P08217 

gll9255 

P08217 GI: 119255 

swiseprot: locus EL2A_HUMAN, accession P08217; 
class: standard, 
created: Aug 1, 1988. 
sequence updated: Aug 1, 1988. 
annotation updated: Aug 20, 2001. 

xrefs ; gi: gi: 182022, gi: gi : 182023, gi: gi: 182057, 

gi : gi : 

182058, gi: gi: 88298, gi: gi: 88299 

xrefs (non- sequence databases): MEROPS S01. 155, 
Inter Pro IPR0Q1314, 

InterPro IPR001254, Pfam PF00089, PRINTS PR00722, 
PROSITE PS50240, 

PROSITE PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Pancreas; Zymogen; Signal. 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia ; Eutheria ; Primates ; Catarrhini ; Hominidae ,- 

Homo. 

REFERENCE 1 (residues 1 to 269) 

AUTHORS Kawashima,I. , Tani,T., Shimoda,K. and Takiguchi f Y. 

TITLE Characterization of pancreatic elastase II cDNAs: two 

elastase II 

mRNAs are expressed in human pancreas 
JOURNAL DNA 6 (2) , 163-172 (1987) 
MEDLINE B7217962 
REMARK SEQUENCE FROM N.A. 
REFERENCE 2 (residues 1 to 269) 

AUTHORS Fletcher, T.S. / Shen,W.F. and Largman,C. 
TITLE Primary structure of human pancreatic elastase 2 

determined by 

sequence analysis of the cloned mRNA 
Biochemistry 26 (23), 7256-7261 (1987) 
88107669 

SEQUENCE FROM N.A. 
[FUNCTION] ACTS UPON ELASTIN. 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE: LEU- 1 -XAA, 



JOURNAL 
MEDLINE 
REMARK 



MET- | -XAA 



AND PHE- | -XAA. HYDROLYSES ELASTIN. 
[SUBCELLULAR LOCATION] SECRETED. 
[TISSUE SPECIFICITY] PANCREAS. 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY SI; ALSO 



KNOWN AS THE 

FEATURES 

source 



Region 
Protein 



TRYPSIN FAMILY. ELASTASE SUBFAMILY. 
Location/Qualifiers 
1. .269 

/organism= w Homo sapiens" 
/ db_xr e f = * t axon : 9 6 0 6 " 
1. .16 

/region_name= ^Signal " 
1. .269 

/product^ w ELASTASE 2A PRECURSOR" 
/EC_number== M 3 . 4 . 21 . 71" 



WO 03/008537 
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Region 



Region 



Bond 



Site 



Site 



Bond 



Bond 



Bond 



Site 



17. .28 

/ r egion_name = " Propep t i de " 
/note= "ACTIVATION PEPTIDE - " 
29- .269 

/region_name= "Mature chain" 
/note= "ELASTASE 2A." 
bond (58, 74) 

/bond_t ype = " di sul f i de 91 
/note="BY SIMILARITY . n 
73 

/site_type= "active" 

/note=" CHARGE RELAY SYSTEM (BY SIMILARITY) .* 
121 

/site_type= u active" 

/note= "CHARGE RELAY SYSTEM (BY SIMILARITY) . * 

bond (15 5 ,222) 

/bond_type= " di sul fide " 

/note="BY SIMILARITY." 

bond(186,202) 

/bond_type= "disulfide" 

/note="BY SIMILARITY," 

bond (2 12 ,243) 

/bond_type= "disulfide" 

/note="BY SIMILARITY." 

216 

/ si te_type= "active" 

/ no t e= " CHARGE RELAY SYSTEM (BY SIMILARITY) . " 



ORIGIN 

1 mirtlllstl vagalscgdp typpyvtrw ggeearpnsw pwqvslqyss 
ngkvjyhtcgg 

61 slianswvlt aahcisssrt yrvglgrhnl yvaesgslav svskiwhkd 
wnsngiskgn 

121 diallklanp vsltdkigla clppagtilp nnypcyvtgw grlqtngavp 
dvlqggrllv 

181 vdyatcsssa wwgssvktsm icaggdgvis scngdsggpl ncgasdgrwg 
vhgivsfgsr 

241 lgcnyyhkps vftrvsnyid winaviann 

// 



NP_056933. pancreatic elasta. .. [gi : 7705648] 

LOCUS NP_056933 269 aa PRI 

NOV-2000 

DEFINITION pancreatic elastase IIB [Homo sapiens] . 

NP_056933 
g7705648 

NP_056933 . 1 GI :7705648 
REFSEQ: accession NM_015849.1 



02- 



ACCESSION 
PID 

VERSION 
DBSOURCE 
KEYWORDS 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota? Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Homlnidae; 

Homo. 

REFERENCE 1 (residues 1 to 269) 

AUTHORS Kawashima f I . , Tani , T . , Shimoda , K . and Takiguchi , Y . 

TITLE Characterization of pancreatic elastase II cDNAs: two 

elastase II 



WO 03/008537 
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PCT/US02/10189 



mRNAs are expressed in human pancreas 
JOURNAL DNA 6 (2), 163-172 (1987) 
MEDLINE 87217962 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

M16653 .1. 

Loca t i on/ Qual i f i er s 
1. .269 

/organism= "Homo sapiens" 
/db_xref ="taxon: 9606" 
/chromosomes H 12 " 
/map="12" 
1. .269 

/product= w pancreatic elastase IIB" 
1. .16 

/note- "pancreatic elastase IIB signal 



FEATURES 

source 



Protein • 
sigjpeptide 



peptide* 

Region 



mat_peptide 



peptide" 

Region 



CDS 



28. .262 

/region_name="Trypsin-like serine protease" 
/db_xref= *CDD : Tryp_SPc" 
/note= "Tryp_SPc" 
29. .269 

/products "pancreatic elastase IIB mature 
31. .262 

/region_name= "Trypsin" 
/db_xref="CDD;pfam00089" 
/not e= "trypsin" 
1. .269 
/gene="LOC51032" 
/db_xref ="LocusID : 51032" 
/codedJoy="NM_015849 .1:26.. 835" 

ORIGIN 

1 mirtlllBtl vagalscgvs tyapdmsrml ggeearpnsw pwqvslqyss 
ngqwyhtcgg 

61 slianswvlt aahcisssri yrvmlgqhnl yvaesgslav svskiwhkd 
wnsnqyskgn 

121 diallklanp vsltdkiqla clppagtilp nnypcyvtgw grlqtngalp 
ddlkqgrllv 

181 vdyatcsssg wwgstvktnm icaggdgvic tcngdsggpl ncqasdgrwe 

vhgigsltsv 

241 lgcnyyykps iftrvsnynd winsviann 

// 



PRAME 

LOCUS NM_006115 2148 bp mRNA PRI 19-JUN- 

2001 

DEFINITION Homo sapiens preferentially expressed antigen in 
melanoma (PRAME) , 

mRNA. 

ACCESSION NM_006115 

VERSION NM_J)06115.1 GI : 5174640 

KEYWORDS 

SOURCE human. 

ORGANISM Homo sapiens 



WO 03/008537 
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Eukaryota; Metazoa; Chordata; Craniata; Vertebrata? 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

5 REFERENCE 1 (bases 1 to 2148) 

AUTHORS Ikeda , H . , Lethe , B . , Lehmann , F . , Van Baren, N . , 

Baurain, J. -F. , De 

Smet,C, Chambost,H., Vitale,M., Moretta,A. , Boon,T. and 

Coulie,P.G. 

10 TITLE Characterization of an antigen that is recognized on a 

melanoma 

showing partial HLA loss by CTL expressing an NK inhibitc 
receptor 

JOURNAL Immunity 6 (2), 199-208 (1997) 
15 MEDLINE 97199265 

REFERENCE 2 (bases 1 to 2148) 

AUTHORS Williams JM, Chen GC, Zhu L and Rest RF. 

TITLE Using the yeast two-hybrid system to identify human 
epithelial cell 

20 proteins that bind gonococcal Opa proteins: intracellular 

gonococci 

bind pyruvate kinase via their Opa proteins and 

require host 

pyruvate for growth 
25 JOURNAL Mol. Microbiol. 27 (1), 171-186 (1998) 

MEDLINE 98125741 
PUBMED 9466265 
REFERENCE 3 (bases 1 to 2148) 

AUTHORS van Baren.N., Chambost,H., Ferrant,A. , Michaux,L., Ikeda 
30 Millard, I . , Olive,!)., Boon,T. and Coulie,P.G. 

TITLE PRAME, a gene encoding an antigen recognized on a 
human melanoma by 

cytolytic T cells, is expressed in acute leukaemia cells 
JOURNAL Br. J. Haematol. 102 (5), 1376-1379 (1998) 
35 MEDLINE 98423996 

PUBMED 9753074 
REFERENCE 4 (bases 1 to 2148) 

AUTHORS Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins 

Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough 

40 R, Almeida 

JP, Babbage A, Bagguley C, Bailey J, Barlow K, Bates 

KN, Beasley O, 

Bird CP, Blakey S, Bridgeman AM, Buck D, Burgess J, 

Burr ill WD, 
45 O'Brien KP and et al. 

TITLE The DNA sequence of human chromosome 22 
JOURNAL Nature 402 (6761), 489-495 (1999) 
MEDLINE 20057165 
PUBMED 10591208 

50 REMARK Erratum: [ [published erratum appears in Nature 2000 Apr 

20;404(6780) :904]] 
COMMENT REVIEWED REFSEQ: This record has been curated by NCBI 

staff. The 

reference sequence was derived from U65011.1. 
55 Summary: The protein encoded by this gene has a 509 

amino acid 

antigen, lacking a signal sequence, and recognized on 

a human 
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melanoma cell line by a T- lymphocyte clone. A 
significant level of 

this mRNA is detected in normal testis as well as in 

many 

5 melanomas, non-small cell lung carcinomas, sarcomas , 

head and neck 

tumors and renal carcinomas. The encoded protein is 

expressed 

predominantly in acute leukemiaa carrying chromosomal 

10 abnormalities 

such as translocation t(8:21), which fuses the AML1 

and ETO genes. 

" Its expression shares several characteristics with the 

expression 

15 patterns of MAGE, BAGE, and GAGE gene families, all of 

which are 

expressed in tumors. This protein is expressed in a 

higher 

proportion of samples than genes of the MAGE, BAGE, 

20 and GAGE 

families, and of these four groups, only this protein 

is expressed 

by acute myeloid leukemias. 
COMPLETENESS: complete on the 3' end. 
25 FEATURES Location/ Qualifiers 

source 1- .2148 

/organism= "Homo sapiens' 7 
/db_xref=*taxon:9606" 
/ chr omos ome = u 2 2 " 
30 /raap= w 22qll.22" 

/haplotype= w A24 A28 B13 B44 Cw6 Cw7" 
gene 1. .2148 

/gene=*PRAME" 
/note= w MAPE; OIP4" 
35 /db_xref=*LocusID: 23532" 

/ db_xr e f = *MIM : 6 0 6 0 2 1 " 
CDS 236.. 1765 

/gene= w PRAME" 

/note=*melanoma antigen preferentially 

40 expressed in 

tumors; Opa- interacting protein OIP4" 
/codon_start=l 
/db_xref=*LocusID: 23532" 
/db_xref= n MIM: 606021" 
45 /product= ^preferentially expressed antigen of 



melanoma" 



/protein_id= tt NP_006106 . 1" 
/db_xref= w GI: 5174 641" 



50 /trans lation= w MERRRLWGSIQSRYISMSVWSPRRLV^ 

FPPLFMAAFDGRHSQTLKAWQAWPFTCLPLGVIMKGQ^ 
LQVIjDLRKNSHQDFWTVWSGNRASLYSFPEPEAAQPMTK 
EGACDELFSYLIEKVKRKKNVLRDCCKKLKI 

KFSPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQALYVDSLFFLRGRLDQL 
55 LRHVMNPLBTLS ITNC!RLSEGDVMHLS QS PS VS QLSVLSLS GYMLTDVS PEPLQALLERASATLQD 

LVFDECGITDDQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGLSNLTHVLYPVPLESY 
EDIHGTLHLERIjAYLHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPCFMPN" 
variation 254 
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variation 



variation 



variation 



variation 



variation 



variation 



in 



variation 



/allele=*C" 
/allele=*T" 

/db_xref « "dbSNP : 1129172'' 
698 

/allele=*C 
/alleles *A" 

/db_xref = "dbSNP :1063107" 
1001 

/ allele- »C" 
/allele= u T" 

/ db_xr e f = * db S NP : 2 2 2 9 6 9 5 " 
1498 

/allele= u T" 
/allele= tt C 

/db_xref= w dbSNP: 2229696" 
1762 

/allele= w C" 
/allele=="T" 

/db_xref="dbSNP : 13604" 
1952 

/allele= w A" 
/alleles «G" 
/db_xref= "dbSNP : 7104" 
1984 

/allele^A 1 
/allele= tt C 

/db_xref = "dbSNP : 8405" 

2043 
/allele="T" 
/allele= w A" 

/db_xr ef = 11 dbSNP : 1 8 0 4 6 8 5 " 

polyA_signal 2108 . • 2113 

polyA_site 2130 
BASE COUNT 534 a 548 C 558 g 508 t 

ORIGIN 

1 gcttcagggt acagctcccc cgcagccaga agccgggcct 
agcaccgctc 

61 cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca 
ggtgtggtga 

121 actctctgag gaaaaaccat tttgattatt actctcagac 
acaagtgact 

181 gagacctaga aatccaagcg ttggaggtcc tgaggccagc 
tcaaaatgga 

241 acgaaggcgt ttgtggggtt ccattcagag ccgatacatc 
tgtggacaag 

301 cccacggaga cttgtggagc tggcagggca gagcctgctg 
ccctggccat 

,361 tgccgccctg gagttgctgc ccagggagct cttcccgcca 
cagcctttga 

421 cgggagacac agccagaccc tgaaggcaat ggtgcaggcc 
cctgcctccc 

481 tctgggagtg ctgatgaagg gacaacatct tcacctggag 
ctgtgcttga 

541 tggacttgat gtgctccttg cccaggaggt tcgccccagg 
ttcaagtgct 

601 ggatttacgg aagaactctc atcaggactt ctggactgta 
acagggccag 



gcagcccctc 
gcaacttcgc 
gtgcgtggca 
ctaagtcgct 
agcatgagtg 
aaggatgagg 
ctcttcatgg 
tggcccttca 
accttcaaag 
aggtggaaac 
tggtctggaa 
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661 tctgtactca tttccagagc cagaagcagc tcagcccatg acaaagaagc 
gaaaagtaga 

721 tggtttgagc acagaggcag agcagccctt cattccagta gaggtgctcg 
tagacctgtt 

5 781 cctcaaggaa ggtgcctgtg atgaattgtt ctcctacctc attgagaaag 

tgaagcgaaa 

841 gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatt tttgcaatgc 
ccatgcagga 

901 tatcaagatg atcctgaaaa tggtgcagct ggactctatt gaagatttgg 

10 aagtgacttg 

961 tacctggaag ctacccacct tggcgaaatt ttctccttac ctgggccaga 

tgattaatct 

1021 gcgtagactc ctcctctccc acatccatgc atcttcctac atttccccgg 
agaaggaaga 

15 1081 gcagtatatc gcccagttca cctctcagtt cctcagtctg cagtgcctgc 

aggctctcta 

1141 tgtggactct ttatttttcc ttagaggccg cctggatcag ttgctcaggc 
acgtgatgaa 

1201 ccccttggaa accctctcaa taactaactg ccggctttcg gaaggggatg 

20 fcgatgcatct 

1261 gtcccagagt cccagcgtca gtcagctaag tgtcctgagt ctaagtgggg 

tcatgctgac 

1321 cgatgtaagt cccgagcccc tccaagctct gctggagaga gcctctgcca 
ccctccagga 

25 1381 cctggtcttt gatgagtgtg ggatcacgga tgatcagctc cttgccctcc 

tgccttccct 

1441 gagccactgc tcccagctta caaccttaag cttctacggg aattccatct 
ccatatctgc 

1501 cttgcagagt ctcctgcagc acc teat egg gctgagcaat ctgacccacg 
30 tgctgtatcc 

1561 tgtccccctg gagagttatg aggacatcca tggtaccctc cacctggaga 
ggcttgecta 

1621 tctgcatgcc aggctcaggg agttgctgtg tgagttgggg cggcccagca 
tggtctggct 

35 1681 tagtgecaac ccctgtcctc actgtgggga cagaaccttc tatgaccegg 

agcccatcct 

1741 gtgcccctgt ttcatgecta actagctggg tgeacatate aaatgettea 
ttctgeatae 

1801 ttggacacta aagecaggat gtgeatgeat cttgaagcaa caaagcagcc 

40 acagtttcag 

1861 acaaatgttc agtgtgagtg aggaaaacat gttcagtgag gaaaaaacat 

tcagacaaat 

1921 gttcagtgag gaaaaaaagg ggaagttggg gataggcaga tgttgacttg 
aggagttaat 

45 1981 gtgatctttg gggagataca tcttatagag ttagaaatag aatctgaatt 

tctaaaggga 

2041 gattctggct tgggaagtac atgtaggagt taatccctgt gtagactgtt 
gtaaagaaac 

2101 tgttgaaaat aaagagaagc aatgtgaagc aaaaaaaaaa aaaaaaaa 
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ED-B domain of Fibronectin 



LOCUS HSFIBEDB 2823 bp DNA linear 

PRI 09-AUG-1999 
5 DEFINITION Human fibronectin gene ED-B region. 

ACCESSION X07717 
VERSION X07717.1 GI:31406 

KEYWORDS alternate splicing; fibronectin. 
SOURCE human . 

10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi ; 

Mammalia; Eutheria; Primates? Catarrhini; Hominidae; 

Homo. 

15 REFERENCE 1 (bases 1 to 2823) 

AUTHORS Paolella,G., Henchclif f e,C. , Sebastio,G. and 

Baralle,F.E. 

TITLE Sequence analysis and in vivo expression show that 
alternative 

20 splicing of ED-B and ED- A regions of the human 

fibronectin gene are 

independent events 
JOURNAL Nucleic Acids Res. 16 (8) , 3545-3557 (1988) 
MEDLINE 88233940 
25 FEATURES Location/Qualifiers 

source 1..2823 

/organism^ w Homo sapiens 11 
/db_xref = » taxon : 9 606 " 
/clone="MA10" 

30 exon 1..104 

/number=l 

/product= " fibronectin" 
CDS join(<2. .104, 13 75. .1647,2758. .>2823) 

/ codon_s tar t = 1 

■ 

35 /pr oduc t = " f ibr one c t in " 

/protein_id="CAB52437 . 1" 
/db_xref = "GI : 5725425" 

/ trans lat ion= w CTFDNLS PGLEYNVSVYTVKDDKESVPI SDTI I PEVPQLTDLSF 
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VDI TDSS I GLRWTPLNS ST 1 1 GYRI TWAAGEGI P I FEDFVDS SVGYYTVTGLEPG ID 

YDISVITLINGGESAPTTLTQQTAVPPPTDLRFTWIGPDTMRVTW" 
5 intron 105. .1374 

/number«l 
exon 1375.. 1647 

/note="ED-B exon" 
/number =2 

1 0 /pr oduc t = " f ibr one c t in » 

intron 1648.. 2757 

/numbers 2 
exon 2758.. 2823 

/number=*3 

15 /product= " f ibronectin" 

BASE COUNT 824 a 556 c 528 g 915 t 

ORIGIN 

1 ctgcactttt gataacctga gtcccggcct ggagtacaat gtcagtgttt 
acactgtcaa 

20 61 ggatgacaag gaaagtgtcc ctatctctga taccatcatc ccaggtaata 

gaaaataagc 

121 tgctatcctg agagtgacat tccaataaga gtggggatta gcatcttaat 
ccccagatgc 

181 ttaagggtgt caactatatt tgggatttaa ttccgatctc ccagctgcac 

25 tttccaaaac 

241 caagaagtca aagcagcgat ttggacaaaa tgcttgctgt taacactgct 

ttactgtctg 

301 tgcttcactg ggatgctgtg tgttgcagcg agtatgtaat ggagtggcag 
ccatggcttt 

30 361 aactctgtat tgtctgctca catggaagta tgactaaaac actgtcacgt 

gtctgtactc 

421 agtactgata ggctcaaagt aatatggtaa atgcatccca tcagtacatt 
tctgcccgat 

481 tttacaatcc atatcaattt ccaacagctg cctatttcat cttgcagttt 

35 caaatccttc 

541 tttttgaaaa ttggatttta aaaaaaagtt aagtaaaagt cacaccttca 

gggttgttct 

601 ttcttgtggc cttgaaagac aacattgcaa aggcctgtcc taaggatagg 
cttgtttgtc 
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661 cattgggtta taacataatg aaagcattgg acagatcgtg tccccctttg 
gactcttcag 

721 tagaatgctt ttactaacgc taattacatg ttttgattat gaatgaacct 
aaaatagtgg 

5 781 caatggcctt aacctaggcc tgtctttcct cagcctgaat gtgcttttga 

atggcacatt 

841 tcacaccata cattcataat gcattagcgt tatggccatg atgttgtcat 
gagttttgta 

901 tgggagaaaa aaaatcaatt tatcacccat ttattatttt ttccggttgt 

10 tcatgcaagc 

961 ttattttcta ctaaaacagt tttggaatta ttaaaagcat tgctgatact 

tacttcagat 

1021 attatgtcta ggctctaaga atggtttcga catcctaaac agccatatga 
tttttaggaa 

15 1081 tctgaacagt tcaaattgta ccctttaagg atgttttcaa aatgtaaaaa 

atatatatat 

1141 atatatatat tccctaaaag aatattcctg tttattcttc tagggaagca 
aactgttcat 

1201 gatgcttagg aagtcttttc agagaattta aaacagattg catattacca 

20 tcattgcttt 

1261 aacattccac caattttact actagtaacc tgatatacac tgctttattt 

tttcctcttt 

1321 ttttccctct attttccttt tgcctccccc tccctttgct ttgtaactca 
atagaggtgc 

25 1381 cccaactcac tgacctaagc tttgttgata taaccgattc aagcatcggc 

ctgaggtgga 

1441 ccccgctaaa ctcttccacc attattgggt accgcatcac agtagttgcg 

■ 

gcaggagaag 

1501 gtatccctat ttttgaagat tttgtggact cctcagtagg atactacaca 

30 gtcacagggc 

1561 tggagccggg cattgactat gatatcagcg ttatcactct cattaatggc 

ggcgagagtg 

1621 cccctactac actgacacaa caaacgggtg aattttgaaa acttctgcgt 
ttgagacata 

35 1681 gatggtgttg catgctgcca ccagttactc cggttaaata tggatgtttc 

atgggggaag 

1741 tcagcaattg gccaaagatt cagataggtg gaattggggg gataaggaat 
caaatgcatc 
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1801 tgctaaactg attggagaaa aacacatgca atatcttcag tacactctca 
tttaaaccac 

1861 aagtagatat aaagcctaga gaaatacaga tgtctgctct gttaaatata 
aaatagcaaa 

5 1921 tgttcattca atttgaagac ctagaatttt tcttcttaaa taccaaacac 

gaataccaaa 

1981 ttgcgtaagt accaattgat aagaatatat caccaaaatg taccatcatg 
ctcttccttc 

2041 taccctttga taaactctac catgctcctt ctttgtagct aaaaacccat 

10 caaaatttag 

2101 ggtagagtgg atgggcattg ttttgaggta ggagaaaagt aaacttggga 

ccattctagg 

2161 ttttgttgct gtcactaggt aaagaaacac ctctttaacc acagtctggg 
gacaagcatg 

15 2221 caacatttta aaggttctct gctgtgcatg ggaaaagaaa catgctgaga 

accaatttgc. 

2281 atgaacatgt tcacttgtaa gtagaattca ctgaatggaa ctgtagctct 
agatatctca 

2341 catgggggga agtttaggac cctcttgtct ttttgtctgt gtgcatgtat 

20 ttctttgtaa 

2401 agtactgcta tgtttctctt tgctgtgtgg caacttaagc ctcttcggcc 

tgggataaaa 

2461 taatctgcag tggtattaat aatgtacata aagtcaacat atttgaaagt 
agattaaaat 

25 2521 cttttttaaa tatatcaatg atggcaaaaa ggttaaaggg ggcctaacag 

, tactgtgtgt 

2581 agtgttttat ttttaacagt agtacactat aacttaaaat agacttagat 
tagactgttt 

2641 gcatgattat gattctgttt cctttatgca tgaaatattg attttacctt 

30 tccagctact 

2701 tcgttagctt taattttaaa atacattaap tgagtcttcc ttcttgttcg 

aaaccagctg 

2761 ttcctcctcc cactgacctg cgattcacca acattggtcc agacaccatg 
cgtgtcacct 
35 2821 ggg 

// 
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CEA 

LOCUS NM_004363 2974 bp mRNA linear 

PRI 28-NOV-2000 

DEFINITION Homo sapiens carcinoembryonic antigen-related cell 
5 adhesion 

molecule 5 (CEACAM5) , mRNA. 
ACCESSION NM_004363 

VERSION NM_004363,1 GI:11386170 

KEYWORDS 
10 SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota? Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

15 Homo. 

REFERENCE 1 (bases 1 to 2974) 

AUTHORS Oikawa S, Nakazato H and Kosaki G. 

TITLE Primary structure of human carcinoembryonic antigen 

(CEA) deduced 

20 from cDNA sequence 

JOURNAL Biochem. Biophys. Res. Commun. 142 (2), 511-518 (1987) 

MEDLINE 87128144 

PUBMED 3814146 

REFERENCE 2 (bases 1 to 2974) 

25 AUTHORS Zimmermann W, Weber B, Ortlieb B, Rudert F, Schempp W, 

Fiebig HH, 

Shively JE, von Kleist S and Thompson JA. 
TITLE Chromosomal localization of the carcinoembryonic 

antigen gene 

30 family and differential expression in various tumors 

JOURNAL Cancer Res. 48 (9), 2550-2554 (1988) 
MEDLINE 88184584 
PUBMED 3356015 
REFERENCE 3 (bases 1 to 2974) 
35 AUTHORS Barnett,T., Goebel,S.J., Nothdurf t,M.A. and 

Elting, J.J. 

TITLE Carcinoembryonic antigen family: characterization of 

cDNAs coding 
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for NCA and CEA and suggestion of nonrandom sequence 

variation in 

their conserved loop-domains 
JOURNAL Genomics 3 (1), 59-66 (1988) 
5 MEDLINE 89122014 

REFERENCE 4 (bases 1 to 2974) 

AUTHORS Barnett T and Zimmermann W. 

TITLE Workshop report: proposed nomenclature for the 

c ar c inoembryoni c 
10 antigen (CEA) gene family 

JOURNAL Tumour Biol. 11 (1-2), 59-63 (1990) 
MEDLINE 90176333 
PUBMED 2309067 
REFERENCE 5 (bases 1 to 2974) 
15 AUTHORS Schrewe H, Thompson J, Bona M # Hefta LJ r Maruya A, 

Hassauer M, 

Shively JE, von Kleist S and Zimmermann W. 
TITLE Cloning of the complete gene for care inoembryoni c 

antigen: analysis 

20 of its promoter indicates a region conveying cell 

type-specific 

expression 

JOURNAL Mol. Cell. Biol. 10 (6), 2738-2748 (1990) 
MEDLINE 90258861 
25 PUBMED 2342461 

COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

M29540 .1. 

30 FEATURES Location/ Qualifiers 

source 1. .2974 

/organism="Homo sapiens" 

/db_xref «"taxon: 9606" 

/ chr omos ome ="19" 
35 /map= » 19ql3 . 1 -gl3 . 2 « 

gene 1..2974 

/gene=*»CEACAM5» 

/note=»CEA; CD66e" 

/db xref="LocusID:1048» 
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CDS 



cell adhesion 



10 



/ dbjtr e f = " MIM : 1 14 8 9 0 " 
115. .2223 
/gene="CEACAM5" 

/note="carcinoembryonic antigen" 

/codon_start=l 

/ db_xre f = " Locus ID: 104 8" 

/dbjcref = "MIM : 114890 " 

/product^ "carcinoembryonic antigen-related 
molecule 5" 

/protein_id=="NP_004354 .1" 
/db_xref="GI: 11386171" 



/ trans la t ion» " MES PSAPPHRWC I PWQRLLLTASLLTFWNP PTTAKLT I E STPFN 
15 VAEGKEVLLLVHNLPQHLFGYSWYKGERWGNRQIIGYVIGTQQATPGPAYSGREI IY 

PNASLLIQNI IQNDTGFYTLHVI KSDLVNEEATGQFRVYPELPKPSI SSNNSKPVEDK 
DAVAFTCE PETQDAT YLWWVNNQS LPVS PRLQLSNGNRTLTLFNVTRNDTASYKCETQ 
NPVSARRSDSVILNVLYGPDAPTI S PLNTSYRSGENLNLSCHAASNPPAQYSWFVNGT 
FQQSTQELFI PNI TVNNSGS YTCQAHNSDTGLNRTTVTT I TVYAEPPKPF I TSNNSNP 
20 VEDEDAVALT CE PEI QNTTYLWWVNNQSLPVS PRLQLSNDNRTLTLIiSVTRNDVGPYE 

CGIQNELSVDHSDPVILNVLYGPDDPTI S PS YTYYRPGVNLSLSCHAASNPPAQYSWL 
IDGNI QQHTQELF I SNI TEKNSGLYTCQANNS ASGHSRTTVKT I TVS AELPKPS ISSN 
NSKPVEDKDAVAFTCEPEAQOTTYLWWVNGQSLPVSPRLQLSNG^TLTLFWTRNDA 

RAWCGIQNSVSANRSDPVTLDVIiYGPDTPIISPPDSSYLSGANI^ 
25 YSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSKTLATGRNN 

LSAGATVGIMIGVLVGVALI " 

misc_f eature 595 . . 780 

/note= n IG; Region: Immunoglobulin" 

misc_f eature 868 . . 1026 
30 /note="lG; Region: Immunoglobulin" 

misc_f eature 1129. .1314 

/note="IG; Region: Immunoglobulin" 

misc_feature 1399. .1566 

/note="IG; Region: Immunoglobulin" 

35 misc_feature 1930.. 2091 

/note="IG; Region: Immunoglobulin" 

BASE COUNT 840 a 847 C 613 g 674 t 

ORIGIN 
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1 ctcagggcag agggaggaag 
gacaaaacgt 

61 tcctggaact caagctcttc 

gaccatggag 
5 121 tctccctcgg cccctcccca 

gctcacagcc 

181 tcacttctaa ccttctggaa 
atccacgccg 

241 ttcaatgtcg cagaggggaa 

10 ccagcatctt 

301 tttggctaca gctggtacaa 

tataggatat 

361 gtaataggaa ctcaacaagc 
gataatatac 
15 421 cccaatgcat ccctgctgat 

ctacacccta 

481 cacgtcataa agtcagatct 

ggtatacccg 

541 gagctgccca agccctccat 

20 caaggatgct 

601 gtggccttca cctgtgaacc 

ggtaaacaat 

661 cagagcctcc cggtcagtcc 

cctcactcta 
25 721 ttcaatgtca caagaaatga 

cccagtgagt 

781 gccaggcgca gtgattcagt 
ccccaccatt 

841 tcccctctaa acacatctta 

30 ccacgcagcc 

901 tctaacccac ctgcacagta 

atccacccaa 

961 gagctcttta tccccaacat 

ccaagcccat 
35 1021 aactcagaca ctggcctcaa 

tgcagagcca 

1081 cccaaaccct tcatcaccag 
tgctgtagcc 



gacagcagac cagacagtca cagcagcctt 

tccacagagg aggacagagc agacagcaga 

cagatggtgc atcccctggc agaggctcct 

cccgcccacc actgccaagc tcactattga 

ggaggtgctt ctacttgtcc acaatctgcc 

aggtgaaaga gtggatggca accgtcaaat 

taccccaggg cccgcataca gtggtcgaga 

ccagaacatc atccagaatg acacaggatt 

tgtgaatgaa gaagcaactg gccagttccg 

ctceagcaac aactccaaac ccgtggagga 

tgagactcag gacgcaaccfc acctgtggtg 

* 

caggctgcag ctgtccaatg gcaacaggac 

cacagcaagc tacaaatgtg aaacccagaa 

catcctgaat gtcctctatg gcccggatgc 

cagatcaggg gaaaatctga acctctcctg 

ctcttggttt gtcaatggga ctttccagca 

cactgtgaat aatagtggat cctatacgtg 

taggaccaca gtcacgacga tcacagtcta 

caacaactcc aaccccgtgg aggatgagga 
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1141 ttaacctgtg aacctgagat 
taatcagagc 

1201 ctcccggtca gtcccaggct 

tctactcagt 
5 1261 gtcacaagga atgatgtagg 

aagtgttgac 

1321 cacagcgacc cagtcatcct 

catttccccc 

1381 tcatacacct attaccgtcc 

10 agcctctaac 

1441 ccacctgcac agtattcttg 

acaagagctc 

1501 tttatctcca acatcactga 
caataactca 
15 1561 gccagtggcc acagcaggac 

gctgcccaag 

1621 ccctccatct ccagcaacaa 

ggccttcacc 

1681 tgtgaacctg aggctcagaa 

20 gagcctccca 

1741 gtcagtccca ggctgcagct 

caatgtcaca 

1801 agaaatgacg caagagccta 
aaaccgcagt 

25 1861 gacccagtca ccctggatgt 

ccccccagac 

1921 tcgtcttacc tttcgggagc 
taacccatcc 

1981 ccgcagtatt cttggcgtat 
30 tctctttatc 

2041 gccaaaatca cgccaaataa 
cttggctact 

2101 ggccgcaata attccatagt 
ttctcctggt 

35 2161 ctctcagctg gggccactgt 

tgctctgata 

2221 tagcagccct ggtgtagttt 
gcttcttcct 



tcagaacaca acctacctgt ggtgggtaaa 

gcagctgtcc aatgacaaca ggaccctcac 

accctatgag tgtggaatcc agaacgaatt 

gaatgtcctc tatggcccag acgaccccac 

aggggtgaac ctcagcctct cctgccatgc 

gctgattgat gggaacatcc agcaacacac 

gaagaacagc ggactctata cctgccaggc 

tacagtcaag acaatcacag tctctgcgga 

ctccaaaccc gtggaggaca aggatgctgt 

cacaacctac ctgtggtggg taaatggtca 

gtccaatggc aacaggaccc tcactctatt 

tgtatgtgga atccagaact cagtgagtgc 

cctctatggg ccggacaccc ccatcatttc 

gaacctcaac ctctcctgcc actcggcctc 

caatgggata ccgcagcaac acacacaagt 

taacgggacc tatgcctgtt ttgtctctaa 

caagagcatc acagtctctg catctggaac 

cggcatcatg attggagtgc tggttggggt 

cttcatttca ggaagactga cagttgtttt 
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2281 taaagcattt gcaacagcta cagtctaaaa ttgcttcttt accaaggata 
tttacagaaa 

2341 agactctgac cagagatcga gaccatccta gccaacatcg tgaaacccca 
tctctactaa 

5 2401 aaatacaaaa atgagctggg cttggtggcg cgcacctgta gtcccagtta 

ctcgggaggc 

2461 tgaggcagga gaatcgcttg aacccgggag gtggagattg cagtgagccc 
agatcgcacc 

2521 actgcactcc agtctggcaa cagagcaaga ctccatctca aaaagaaaag 

10 aaaagaagac 

2581 tctgacctgt actcttgaat acaagtttct gataccactg cactgtctga 

gaatttccaa 

2641 aactttaatg aactaactga cagcttcatg aaactgtcca ccaagatcaa 
gcagagaaaa 

15 2701 taattaattt catgggacta aatgaactaa tgaggattgc tgattcttta 

aatgtcttgt 

2761 ttcccagatt tcaggaaact ttttttcttt taagctatcc actcttacag 
caatttgata 

2821 aaatatactt ttgtgaacaa aaattgagac atttacattt tctccctatg 

20 tggtcgctcc 

2881 agacttggga aactattcat gaatatttat attgtatggt aatatagtta 

ttgcacaagt 

2941 tcaataaaaa tctgctcttt gtataacaga aaaa 

// 

25 

Ker2/Neu 

LOCUS HUMHER2A 4530 bp mRNA linear 

PRI 18-SEP-1995 

30 DEFINITION Human tyrosine kinase- type receptor (HER2) mRNA, 

complete cds. 

ACCESSION M11730 

VERSION M11730.1 GI: 183986 

KEYWORDS tyrosine kinase. 
35 SOURCE Homo sapiens (clone; lambda-HER2-436) fetal cDNA to 

mRNA. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostorai; 
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Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 4530) 

AUTHORS Coussens,L., Yang- Feng, T . L . , Liao, Y. -C. , Chen,B., 

5 Gray, A., 

McGrath, J. , Seeburg,P.H. , Libermann,T.A. , 

Schlessinger, J. , 

Francke ,U M Levinson , A . and Ullrich, A, 
TITLE Tyrosine kinase receptor with extensive homology to 

10 EGF receptor 

shares chromosomal location with neu oncogene 
JOURNAL Science 230 (4730), 1132-1139 (1985) 
MEDLINE 86070181 
REFERENCE 2 (bases 1701 to 1719) 
15 AUTHORS Ullrich, A. 

JOURNAL Unpublished (1988) 
FEATURES Loca t i on/ Qual i f i er s 

source 1. .4530 

/organism="Homo sapiens" 
20 / db_xref « " t axon : 9 6 0 6 n 

/ c 1 one= " 1 ambda - HER2 -436" 
/dev_s tage= "fetal 11 
mRNA <1..4530 

/product« w HER2 mRNA" 
25 CDS 151.. 3918 

/note="HER2 receptor" 
/codonj3tart=l 
/protein_id=»AAA75493 .1" 
/db_xref="GI:306840" 

30 

/ trans lat ion= " MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLD 
MLRHLYQGCQWQGNLELTYLPTNASLS FLQD IQEVQGYVLIAHNQVRQVPLQRLRIV 
RGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQ 
LCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRT 
35 VCAGGC^CKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGI CELHC PALVTYNT 

DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKC 
SKPCARVCYGLGMEHLREVRAVTSANIQE FAGCKKI FGSLAFLPE SFDGDPASNTAPL 
QPEQLQVFETLEE I TGYL YI SAWPDSL PDLS VFQNLQVIRGRILHNGAYS LTLQGLGI 
SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG 



WO 03/008537 



202 



PCT/US02/10189 



IACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVL 
CQPQNGSVTCFGPEADQC^ACMrm)PPFCVARCPS 

PCPINCTHS CVDLDDKGCPAEQRASPLTS IVSAWGILLVWLGWFGILI KRRQQKI 
RKYT^^rcLLQETELVEPLTPSGAMPNQAQMRILKET^ 
5 PDGENVKI PVAI KVIiRENTS PKANKE I LDEAYVMAGVGS PYVSRLLGI CLTSTVQLVT 

QLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAK^^ 
NHVKITDFGLARLLDIDETEYHADGGKVPIKW^ 

LMT FGAKP YDGI PARE I PDLLEKGERLPQPP ICTIDVYMIMVKCWMIDSECRPRFREL 
VSEFSRMARDPQRFWIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGF 

10 FCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLBPSEEEAPRSPLAPSBGAGSDVFDG 
DLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR 
PQPPS PREGPLPAARPAGATLERAKTLS PGKNGWKDVFAFGGAVENPEYLTPQGGAA 
PQPHP PPAFS PAFDNL YYWDQDPPERGAP PSTFKGTPTAENPEYLGLDVPV 1 ' 
old_sequence 1701. .1719 

15 /citation= [1] 

BASE COUNT 922 a 1382 c 1346 g 880 t 

ORIGIN - Chromosome 17q21-q22. 

1 aattctcgag ctcgtcgacc ggtcgacgag ctcgagggtc gacgagctcg 

agggcgcgcg 

20 61 cccggccccc acccctcgca gcaccccgcg ccccgcgccc tcccagccgg 

gtccagccgg 

121 agccatgggg ccggagccgc agtgagcacc atggagctgg cggccttgtg 
ccgctggggg 

181 ctcctcctcg ccctcttgcc ccccggagcc gcgagcaccc aagtgtgcac 

25 cggcacagac 

241 atgaagctgc ggctccctgc cagtcccgag acccacctgg acatgctccg 

ccacctctac 

301 cagggctgcc aggtggtgca gggaaacctg gaactcacct acctgcccac 
caatgccagc 

30 361 ctgtccttcc tgcaggatat ccaggaggtg cagggctacg tgctcatcgc 

tcacaaccaa 

421 gtgaggcagg tcccactgca gaggctgcgg attgtgcgag gcacccagct 
ctttgaggac 

481 aactatgccc tggccgtgct agacaatgga gacccgctga acaataccac 

35 ccctgtcaca 

541 ggggcctccc caggaggcct gcgggagctg cagcttcgaa gcctcacaga 

gatcttgaaa 

601 ggaggggtct tgatccagcg gaacccccag ctctgctacc aggacacgat 
tttgtggaag 



WO 03/008537 



203 



PCT/US02/10189 



661 gacatcttcc acaagaacaa 

ccgctctcgg 

721 gcctgccacc cctgttctcc 

gagttctgag 
5 781 gattgtcaga gcctgacgcg 

caaggggcca 

841 ctgcccactg actgctgcca 
caagcactct 

901 gactgcctgg cctgcctcca 

10 ctgcccagcc 

961 ctggtcacct acaacacaga 

ccggtataca 

1021 ttcggcgcca gctgtgtgac 

cgtgggatcc 
15 1081 tgcaccctcg tctgccccct 

aacacagcgg 

1141 tgtgagaagt gcagcaagcc 

ggagcacttg 

1201 cgagaggtga gggcagttac 

20 caagaagatc 

1261 tttgggagcc tggcatttct 
caacactgcc 

1321 ccgctccagc cagagcagct 
aggttaccta 
25 1381 tacatctcag catggccgga 

cctgcaagta 

1441 atccggggac gaattctgca 
gctgggcatc 

1501 agctggctgg ggctgcgctc 
30 catccaccat 

1561 aacacccacc tctgcttcgt 
gaacccgcac 

1621 caagctctgc tccacactgc 
gggcctggcc 
35 1681 tgccaccagc tgtgcgcccg 

gtgtgtcaac 

1741 tgcagccagt tccttcgggg 

gcaggggctc 



ccagctggct ctcacactga tagacaccaa 
gatgtgtaag ggctcccgct gctggggaga 
cactgtctgt gccggtggct gtgcccgctg 
tgagcagtgt gctgccggct gcacgggccc 
cttcaaccac agtggcatct gtgagctgca 
cacgtttgag tccatgccca atcccgaggg 
tgcctgtccc tacaactacc tttctacgga 
gcacaaccaa gaggtgacag cagaggatgg 
ctgtgcccga gtgtgctatg gtctgggcat 
cagtgccaat atccaggagt ttgctggctg 
gccggagagc tttgatgggg acccagcctc 
ccaagtgttt gagactctgg aagagatcac 
cagcctgcct gacctcagcg tcttccagaa 
caatggcgcc tactcgctga ccctgcaagg 
actgagggaa ctgggcagtg gactggccct 
gcacacggtg ccctgggacc agctctttcg 
caaccggcca gaggacgagt gtgtgggcga 
agggcactgc tggggtccag ggcccaccca 
ccaggagtgc gtggaggaat gccgagtact 
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1801 cccagggagt atgtgaatgc 

tcagccccag 

1861 aatggctcag tgacctgttt 

tgcccactat 
5 1921 aaggaccctc ccttctgcgt 

cctctcctac 

1981 atgcccatct ggaagtttcc 
catcaactgc 

2041 acccactcct gtgtggacct 

10 agccagccct 

2101 ctgacgtcca tcgtctctgc 

gggggtggtc 

2161 tttgggatcc tcatcaagcg 
gcggagactg 
15 2221 ctgcaggaaa cggagctggt 

caaccaggcg 

2281 cagatgcgga tcctgaaaga 
atctggcgct 

2341 tttggcacag tctacaaggg 

20 aattccagtg 

2401 gccatcaaag tgttgaggga 

cttagacgaa 

2461 gcatacgtga tggctggtgt 

catctgcctg 
25 2521 acatccacgg tgcagctggt 

agaccatgtc 

2581 cgggaaaacc gcggacgcct 
gcagattgcc 

2641 aaggggatga gctacctgga 

30 cgctcggaac 

2701 gtgctggtca agagtcccaa 

tcggctgctg 

2761 gacattgacg agacagagta 

gtggatggcg 
35 2821 ctggagtcca ttctccgccg 

ttatggtgtg 

2881 actgtgtggg agctgatgac 
agcccgggag 



caggcactgt ttgccgtgcc accctgagtg 
tggaccggag gctgaccagt gtgtggcctg 
ggcccgctgc cccagcggtg tgaaacctga 
agatgaggag ggcgcatgcc agccttgccc 
ggatgacaag ggctgccccg ccgagcagag 
ggtggttggc attctgctgg tcgtggtctt 
acggcagcag aagatccgga agtacacgat 
ggagccgctg acacctagcg gagcgatgcc 
gacggagctg aggaaggtga aggtgcttgg 
catctggatc cctgatgggg agaatgtgaa 
aaacacatcc cccaaagcca acaaagaaat 
gggctcccca tatgtctccc gccttctggg 
gacacagctt atgccctatg gctgcctctt 
gggctcccag gacctgctga actggtgtat 
ggatgtgcgg ctcgtacaca gggacttggc 
ccatgtcaaa attacagact tcgggctggc 
ccatgcagat gggggcaagg tgcccatcaa 
gcggttcacc caccagagtg atgtgtggag 
ttttggggcc aaaccttacg atgggatccc 
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2941 atccctgacc tgctggaaaa 

caccattgat 

3001 gtctacatga tcatggtcaa 

aagattccgg 
5 3061 gagttggtgt ctgaattctc 

ggtcatccag 

3121 aatgaggact tgggcccagc 

actgctggag 

3181 gacgatgaca tgggggacct 
10 gcagggcttc 

3241 ttctgtccag accctgcccc 
ccgcagctca 

33 01 tctaccagga gtggcggtgg 
agaggaggcc 
15 3361 cccaggtctc cactggcacc 

tggtgacctg 

3421 ggaatggggg cagccaaggg 
ccctctacag 

3481 cggtacagtg aggaccccac 
20 cgttgccccc 

3541 ctgacctgca gcccccagcc 
ccagccccct 

3601 tcgccccgag agggccctct 
ggaaagggcc 
25 3661 aagactctct ccccagggaa 

tgggggtgcc 

3721 gtggagaacc ccgagtactt 
ccaccctcct 

3781 cctgccttca gcccagcctt 

30 accagagcgg 

3841 ggggctccac ccagcacctt 

gtacctgggt 

3901 ctggacgtgc cagtgtgaac 
tgtcctcagg 
35 3961 gagcagggaa ggcctgactt 

cgaccacttc 

4021 caggggaacc tgccatgcca 
cttgagttcc 



99999 a 9 c 99 ctgccccagc cccccatctg 
atgttggatg attgactctg aatgtcggcc 
ccgcatggcc agggaccccc agcgctttgt 
cagtcccttg gacagcacct tctaccgctc 
ggtggatgct gaggagtatc tggtacccca 
999cgctggg ggcatggtcc accacaggca 
ggacctgaca ctagggctgg agccctctga 
ctccgaaggg gctggctccg atgtatttga 
gctgcaaagc ctccccacac atgaccccag 
agtacccctg ccctctgaga ctgatggcta 
tgaatatgtg aaccagccag atgttcggcc 
gcctgctgcc cgacctgctg gtgccactct 
gaatggggtc gtcaaagacg tttttgcctt 
gacaccccag ggaggagctg cccctcagcc 
cgacaacctc tattactggg accaggaccc 
caaagggaca cctacggcag agaacccaga 
cagaaggcca agtccgcaga agccctgatg 
ctgctggcat caagaggtgg gagggccctc 
ggaacctgtc ctaaggaacc ttccttcctg 
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4081 cagatggctg gaaggggtcc agcctcgttg gaagaggaac agcactgggg 
agtctttgtg 

4141 gattctgagg ccctgcccaa tgagactcta gggtccagtg gatgccacag 
cccagcttgg 

5 4201 ccctttcctt ccagatcctg ggtactgaaa gccttaggga agctggcctg 

agaggggaag 

4261 cggccctaag ggagtgtcta agaacaaaag cgacccattc agagactgtc 
cctgaaacct 

4321 agtactgccc cccatgagga aggaacagca atggtgtcag tatccaggct 

10 ttgtacagag 

4381 tgcttttctg tttagttttt actttttttg ttttgttttt ttaaagacga 

aataaagacc 

4441 caggggagaa tgggtgttgt atggggaggc aagtgtgggg ggtccttctc 
cacacccact 

15 4501 ttgtccattt gcaaatatat tttggaaaac 

// 



SCP-1 

20 LOCUS KSSCP1PRT 33 93 bp mRNA linear 

PRI 26-FEB-1997 

DEFINITION H. sapiens mRNA for SCP1 protein. 
ACCESSION X95654 
VERSION X95654.1 GI: 1212982 

25 KEYWORDS SCP1 gene. 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

30 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 3393) 

AUTHORS Meuwissen,R.L. , Meerts,I., Hoover s , J . M . , Leschot,N.J- 

and 

35 Heyting,C. 

TITLE Human Bynaptonemal complex protein 1 (SCP1) ; isolation 

and 

characterization of the cDNA and chromosomal 
localization of the 
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gene 

JOURNAL Genomics 39 (3), 377-384 (1997) 
MEDLINE 97224467 
REFERENCE 2 (bases 1 to 3393) 
5 AUTHORS Meuwissen,R. J.L. 

TITLE Direct Submission 

JOURNAL Submitted (13-FEB-1996) Dr. R.L.J. Meuwissen, 

Agricultural 

University, Genetics, Dreijenlaan 2, 6703 HA 

10 WAGENINGEN, 

NETHERLANDS 
FEATURES Locat i on/Qual if ier s 

source 1 . .3393 

/organi sm= " Homo sapiens " 
15 /db_xref= t, taxon:9606 ,i 

/map=»lpl2-pl3" 
/tissue_type=" testis" 

gene 95.. 3025 

/gene= "SCP1" 
20 CDS 95.. 3025 

/gene-«SCPl » 

/ codon_s t ar t = 1 

/evidence=experimental 

/product = "polypeptide of 976 aa« 
25 /protein_id="CAA64956 .1" 

/db_xref="GI: 1212983" 

/ db_xre f=" SWISS- PROT : Ql 5 4 3 1 " 

/ translations "MEKQKPFALFVPPRS SSSQVSAVKPQTLGGDSTFFKSFNKCTED 
30 DLEFPFAKTNLSKNGENIDSDPALQKVNFLPVLEQVGNSDCHYQEGLKDSDLENSEGL 
SRVFS KLYKEAE KI KKWKVSTEAELRQKE SKLQENRKI I EAQRKAI QELQFGNEKVSL 
KLEEGIQENKDLIKENNATRHLCNLLKETC^ 

EKMITAHGELRVQAENSRLEMHFKLKED YEKI QHLEQE YKKEINDKE KQVS LLL I QI T 
EKENKMKDLTFLLEESRDKOTQLEEKTIOiQSENLKQSIEKQHHLTKE 
35 VSTQKALEEDLQIATKTICQLTEEKETQMEESNKARAAHSFVVTEFETTVCSLEELLR 
TEQQRLEKNEDQLKI LTMELQKKS SELEEMTKLTNNKEVELEELKKVLGEKETLLYEN 
KQFEKIAEELKGTEQELIGLLQAREKEVHDLEIQLTAITTSEQYYSKEVKDLKTELEN 
EKLKNTELTSHCNKLSLENPCELTQETSDMTLELKNQQEDINNNKKQEERMLKQIENLQ 
ETETQLRNELEYWEELKQKRDEVKC^DKSEENCNNLRKQVENKNKYIEELQQENKA 
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LKKKGTAESKQLNVYEIKS^ 
EKAKVIADEAVKLQKEIDKRCQ 

EQS SLRAS LE I EL SNLKAELLSVKKQLE I EREEKE KLKREAKENTATLKEKKDKKTQT 
FLLETPEIYWKLDSKAVPSQTVSRNFTSVDHGISKDKRDYLWTSAKNTLSTPL 
VKTPTKPKLQQRENLNIPIEESKKKRKMAFEFDINSDSSETTDLLSMVSEEETLKTLY 
RNNNP PAS HLCVKTPKKAPS SLTTPGPTIiKFGAI RKMREDRWAVI AKMDRKKKLKEAE 
KLFV" 

BASE COUNT 1440 a 483 c 648 g 822 t 

ORIGIN 

1 gccctcatag accgtttgtt gtagttcgcg tgggaacagc aacccacggt 
ttcccgatag 

61 ttcttcaaag atatttacaa ccgtaacaga gaaaatggaa aagcaaaagc 
cctttgcatt 

121 gttcgtacca ccgagatcaa gcagcagtca ggtgtctgcg gtgaaacctc 
agaccctggg 

181 aggcgattcc actttcttca agagtttcaa caaatgtact gaagatgatt 
tggagtttcc 

241 atttgcaaag actaatctct ccaaaaatgg ggaaaacatt gattcagatc 
ctgctttaca 

3 01 aaaagttaat ttcttgcccg tgcttgagca ggttggtaat tctgactgtc 
actatcagga 

361 aggactaaaa gactctgatt tggagaattc agagggattg agcagagtgt 
tttcaaaact 

421 gtataaggag gctgaaaaga taaaaaaatg gaaagtaagt acagaagctg 
aactgagaca 

481 gaaagaaagt aagttgcaag aaaacagaaa gataattgaa gcacagcgaa 
aagccattca 

541 ggaactgcaa tttggaaatg aaaaagtaag tttgaaatta gaagaaggaa 
tacaagaaaa 

601 taaagattta ataaaagaga ataatgccac aaggcattta tgtaatctac 
tcaaagaaac 

661 ctgtgctaga tctgcagaaa agacaaagaa atatgaatat gaacgggaag 
aaaccaggca 

721 agtttatatg gatctaaata ataacattga gaaaatgata acagctcatg 
gggaacttcg 

781 tgtgcaagct gagaattcca gactggaaat gcattttaag ttaaaggaag 
attatgaaaa 

841 aatccaacac cttgaacaag aatacaagaa ggaaataaat gacaaggaaa 
agcaggtatc 
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901 actactattg atccaaatca 

catttctgct 

961 agaggaatcc agagataaag 

agagtgaaaa 
5 1021 cttaaaacaa tcaattgaga 

atattaaagt 

1081 gtcattacaa agaagtgtga 

agatagcaac 

1141 aaaaacaatt tgtcagctaa 

10 ctaataaagc 

1201 tagagctgct cattcgtttg 

gcttggaaga 

1261 attattgaga acagaacagc 

aaatacttac 
15 1321 catggagctt caaaagaaat 

caaataacaa 

1381 agaagtagaa cttgaagaat 

ttttatatga 

1441 aaataaacaa tttgagaaga 

20 aactaattgg 

1501 tcttctccaa gccagagaga 

ctgccattac 

1561 cacaagtgaa cagtattatt 

ttgaaaacga 
25 1621 gaagcttaag aatactgaat 

aaaacaaaga 

1681 gctcacacag gaaacaagtg 

aagatattaa 

1741 taataacaaa aagcaagaag 

30 aagaaacaga 

1801 aacccaatta agaaatgaac 

aaagagatga 

1861 agttaaatgt aaattggaca 

aacaagttga 
35 1921 aaataaaaac aagtatattg 

aaaaaaaagg 

1981 tacagcagaa agcaagcaac 

tagagttaga 



ctgagaaaga aaataaaatg aaagatttaa 
ttaatcaatt agaggaaaag acaaaattac 
aacagcatca tttgactaaa gaactagaag 
gtactcaaaa ggctttagag gaagatttac 
ctgaagaaaa agaaactcaa atggaagaat 
tggttactga atttgaaact actgtctgca 
aaagattgga aaaaaatgaa gatcaattga 
caagtgagct ggaagagatg actaagctta 
tgaaaaaagt cttgggagaa aaggaaacac 
ttgctgaaga attaaaagga acagaacaag 
aagaagtaca tgatttggaa atacagttaa 
caaaagaggt taaagatcta aaaactgagc 
taacttcaca ctgcaacaag ctttcactag 
atatgaccct agaactcaag aatcagcaag 
aaaggatgtt gaaacaaata gaaaatcttc 
tagaatatgt gagagaagag ctaaaacaga 
agagtgaaga aaattgtaac aatttaagga 
aagaacttca gcaggagaat aaggccttga 
tgaatgttta tgagataaag gtcaataaat 
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2041 actagaaagt gccaaacaga 

aagaaattga 

2101 ggacaaaaag atatcagaag 

aagtaatagc 
5 2161 tgatgaagca gtaaaattac 

aaatagctga 

2221 aatggtagca cttatggaaa 

aagaaagaga 

2281 ctcagaatta ggactttata 

10 gagcatcttt 

. 2341 ggagattgaa ctatccaatc 

aacttgaaat 

2401 agaaagagaa gagaaggaaa 
ctactcttaa 
15 2461 agaaaaaaaa gacaagaaaa 

tttattggaa 

2521 attggattct aaagcagttc 
cagttgatca 

2581 tggcatatcc aaagataaaa 
20 ctttatctac 

2641 accattgcca aaggcatata 
agcaaagaga 

2701 aaacttgaat atacccattg 

ttgaatttga 
25 2761 tattaattca gatagttcag 

aagaagagac 

2821 attgaaaaca ctgtatagga 

tcaaaacacc 

2881 aaaaaaggcc ccttcatctc 

30 gagctataag 

2941 aaaaatgcgg gaggaccgtt 

aaaaactaaa 

3001 agaagctgaa aagttatttg 

agcctaataa 
35 3061 cgtgaaactt atagttaata 

atctggaagt 

3121 fcgagacttaa aaaatacttg 

agcctaaatg 



aatttggaga aatcacagac acctatcaga 
aaaatctttt ggaagaggtt gagaaagcaa 
agaaagaaat tgataagcga tgtcaacata 
aacataagca ccaatatgat aagatcattg 
agagcaaaga acaagaacag tcatcactga 
tcaaagctga acttttgtct gttaagaagc 
aactcaaaag agaggcaaaa gaaaacacag 
cacaaacatt tttattggaa acacctgaaa 
cttcacaaac tgtatctcga aatttcacat 
gagactatct gtggacatct gccaaaaata 
cagtgaagac accaacaaaa ccaaaactac 
aagaaagtaa aaaaaagaga aaaatggcct 
aaactactga tcttttgagc atggtttcag 
acaataatcc accagcttct catctttgtg 
taacaacccc tggacctaca ctgaagtttg 
gggctgtaat tgctaaaatg gatagaaaaa 
tttaatttca gagaatcagt gtagttaagg 
ttttgttctt atttgccaga gccacatttt 
catgaatgat ttgtgtttct ttatattttt 
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3181 ttaactacat attgtctgga aacctgtcat tgtattcaga taattagatg 
attatatatt 

3241 gttgttactt tttcttgtat tcatgaaaac tgtttttact aagttttcaa 
atttgtaaag 

33 01 ttagcctttg aatgctagga atgcattatt gagggtcatt ctttattctt 
tactattaaa 

3361 atattttgga tgcaaaaaaa aaaaaaaaaa aaa 

// 



10 

SSX-4 

LOCUS NMJ)05636 576 bp mRNA linear 

PRI 10-DEC-2001 

DEFINITION Homo sapiens synovial sarcoma, X breakpoint 4 (SSX4) , 

15 mRNA. 

ACCESSION NM_005636 

VERSION NMJ305636.1 GI: 5032122 

KEYWORDS 

SOURCE human . 

20 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; 

Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

25 REFERENCE 1 (bases 1 to 576) 

AUTHORS Gure,A.O., Tureci,0., Sahin,U., Tsang,S., 

Scanlan,M. J. , Knuth,A., 

Pfreundschuh,M. , Old, L.J. and Chen,Y.T. 
TITLE SSX: a multigene family with several members 

30 transcribed in normal 

testis and human cancer 
JOURNAL Int. J. Cancer 72 (6), 965-971 (1997) 

MEDLINE 98021352 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

35 subject to final 

NCBI review. The reference sequence was derived from 

U90841.1. 

FEATURES Location/Qualif iers 

source 1 • . 576 
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/organism= M Homo sapiens 11 
/ db_xr e f » 1 ' t axon : 9 6 0 6 " 
/ chromosomes « X" 

* 

/map= ,f Xpll.3 , » 
1. .576 

/gene=»SSX4" 
/db_xre f = " Locus ID : 6 7 5 9 * 
/db_xref = " MIM : 3 0 03 2 6 » 
1. .567 

/gene= M SSX4» 

/note= n Kruppel -associated box containing SSX 

/codon_start*l 
/db_xref="LocusID:6759" 

/ db_xr ef s "MIM : 3 0 0 3 2 6 " 

/product=" synovial sarcoma, X breakpoint 4" 
/protein_id="NP_005627 .1" 
/db_xref ="GI : 5032123 " 

/translation^ » MNGDDAFARRPRDDAQI SEKLRKAFDDIAKYFSKKEWEKMKSSEKIVY 
VYMKIiNYEVMTKLGFKVTLPPFT^ 

SLQRIFPKIMPKKPAEEENGLKEVPEASGPQNDGKQLCPPGNPSTLBKINKTSGPKRG 
KHAWTHRLRERKQLVVYEEI SDPEEDDE " 

misc_f eature 70 . .246 

/note* "KRAB; Region: krueppel associated box" 

BASE COUNT 187 a . 127 c 150 g 112 t 

ORIGIN 

1 atgaacggag acgacgcctt tgcaaggaga cccagggatg atgctcaaat 
atcagagaag 

61 ttacgaaagg ccttcgatga tattgccaaa tacttctcta agaaagagtg 
ggaaaagatg 

121 aaatcctcgg agaaaatcgt ctatgtgtat atgaagctaa actatgaggt 
catgactaaa 

181 ctaggtttca aggtcaccct cccacctttc atgcgtagta aacgggctgc 
agacttccac 

241 gggaatgatt ttggtaacga tcgaaaccac aggaatcagg ttgaacgtcc 
tcagatgact 

3 01 ttcggcagcc tccagagaat cttcccgaag atcatgccca agaagccagc 
agaggaagaa 



gene 



CDS 



gene" 
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361. aatggtttga aggaagtgcc agaggcatct ggcccacaaa atgatgggaa 
acagctgtgc 

421 cccccgggaa atccaagtac cttggagaag attaacaaga catctggacc 
caaaaggggg 

481 aaacatgcct ggacccacag actgcgtgag agaaagcagc tggtggttta 
tgaagagatc 

541 agcgaccctg aggaagatga cgagtaactc ccctcg 

All patents and publications mentioned in the specification 
are indicative of the levels of those skilled in the art to which 
the invention pertains. All patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be 

incorporated by reference. 

The invention illustratively described herein suitably may be 
practiced in the absence of .any element or elements, limitation or 
limitations which is not specifically disclosed herein. The terms 
and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that 
in the use of such terms and expressions indicates the exclusion 
of equivalents of the features shown and described or portions 
thereof. It is recognized that various modifications are possible 
within the scope of the invention claimed. Thus, it should be 
understood that although the present invention has been 
specifically disclosed by preferred embodiments and optional 
features, modification and variation of the concepts herein 
disclosed may be resorted to by those skilled in the art, and that 
such modifications and variations are considered to be within the 
scope of this invention as defined by the appended claims. 
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1. An isolated epitope, comprising a component selected from the group consisting 

of: 

(i) a polypeptide having the sequence as disclosed in TABLE 1 ; 
5 (ii) an epitope cluster comprising the polypeptide of (i); 

(iii) a polypeptide having substantial similarity to (i) or (ii); 

(iv) a polypeptide having functional similarity to any of (i) through (iii); and 

(v) a nucleic acid encoding the polypeptide of any of (i) through (iv). 

2. The epitope of claim 1 , wherein the epitope is immunologically active. 

10 3. The epitope of claim 1, wherein the polypeptide is less than about 30 amino acids 

in length. 

4. The epitope of claim 1 > wherein the polypeptide is 8 to 1 0 amino acids in length. 

5. The epitope of claim 1, wherein the substantial or functional similarity comprises 

addition of at least one amino acid. 
15 6. The epitope of claim 5, wherein the at least one additional amino acid is at an N- 

termhius of the polypeptide. 

7. The epitope of claim 1 , wherein the substantial or functional similarity comprises a 

substitution of at least one amino acid. 

8. The epitope of claim 1 , the polypeptide having affinity to an HLA-A2 molecule. 
20 9. The epitope of claim 8, wherein the affinity is determined by an assay of binding. 

1 0. The epitope of claim 8, wherein the affinity is determined by an assay of restriction 
of epitope recognition. 

11. The epitope of claim 8, wherein the affinity is determined by a prediction 
algorithm. 

25 12. The epitope of claim I, the polypeptide having affinity to an HLA-B7 or HLA-B5 1 

molecule. 

13. The epitope of claim 1, wherein the polypeptide is a housekeeping epitope. 

14. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 
displayed on a tumor cell. 

30 15. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 

displayed on aneovasculature cell. 

16. The epitope of claim 1 , wherein the peptide is an immune epitope. 

17. The epitope of claim 1 wherein the epitope is a nucleic acid. 

18. A pharmaceutical composition comprising the peptide of claim 1 and a 
35 pharmaceutical^ acceptable adjuvant, carrier, diluent, or exctpient. 

19. The composition of claim 1 8, where the adjuvant is a polynucleotide. 
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20. The composition of claim 19 wherein the polynucleotide comprises a dinucleotide. 

21. The composition of claim 20 wherein the dinucleotide is CpG. 

22. The composition of claim 18, wherein the adjuvant is encoded by a polynucleotide. 

23 . The composition of claim 1 8 wherein the adjuvant is a cytokine. 
5 24. The composition of claim 23 wherein the cytokine is GM-CSR 

25. The composition of claim 18 further comprising a professional antigen-presenting 

cell (pAPC). 

26. The composition of claim 25, wherein the pAPC is a dendritic cell. 

27. The composition of claim 1 8, further comprising a second epitope. 

IQ 28. The composition of claim 27, wherein the second epitope is a polypeptide. 

29. The composition of claim 27, wherein the second epitope is a nucleic acid. 

30. The composition of claim 27, wherein the second epitope is a housekeeping 

epitope. 

31. The composition of claim 27, wherein the second epitope is an immune epitope. 

15 32. A pharmaceutical composition comprising the nucleic acid of claim 1 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

33. A recombinant construct comprising the nucleic acid of Claim 1 . 

34. The construct of claim 33, further comprising a plasmid, a viral vector, or an 
artificial chromosome. 

20 35. The construct of claim 33, further comprising a sequence encoding at least one 

feature selected from the group consisting of a second epitope, an IRES, an ISS, an NIS, and 
ubiquitin. 

36. A purified antibody that specifically binds to the epitope of claim 1 . 

37. A purified antibody that specifically binds to a peptide-MHC protein complex 

25 comprising the epitope of claim 1 . 

38. The antibody of claim 36 or claim 37, wherein the antibody is a monoclonal 

antibody. 

39. A multimeric MHC-peptide complex comprising the epitope of claim 1 . 

40. An isolated T cell expressing a T cell receptor specific for an MHC-peptide 
30 complex, the complex comprising the epitope of claim 1 . 

41 . The T cell of claim 40, produced by an in vitro immunization. 

42. The T cell of claim 40, isolated from an immunized animal. 

43. AT cell clone comprising the T cell of claim 40. 

44. A polyclonal population of T cells comprising the T cell of claim 40. 

35 45. A pharmaceutical composition comprising the T cell of claim 40 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 



WO 03/008537 



216 



PCT7US02/10189 



46. An isolated protein molecule comprising the binding domain of a T cell receptor 
specific for an MHC-peptide complex, the complex comprising the epitope of claim 1 . 

47. The protein of claim 46, wherein the protein is multivalent. 

48. An isolated nucleic acid encoding the protein of claim 46. 

5 49. A recombinant construct comprising the nucleic acid of claim 48. 

50. A host cell expressing the recombinant construct of claim 33 or 49 . 

51. The host cell of claim 50, wherein the host cell is a dendritic cell, macrophage, 

tumor cell, or tumor-derived cell. 

52. The host cell of claim 50, wherein the host cell is a bacterium, fungus, or 

10 protozoan. 

53. A pharmaceutical composition comprising the host cell of claim 50 and a 
pharmaceuticaliy acceptable adjuvant, carrier, diluent, or excipient. 

54. A vaccine or immunotherapeutic composition comprising at least one component 
selected from the group consisting of the epitope of claim 1; the composition of claim 18, 32, 45, or 

15 53; the construct of claim 33'; the T cell of claim 40, and the host cell of claim 50. 

55. A method of treating an animal, comprising: 

administering to an animal the vaccine or immunotherapeutic composition of claim 

54. 

56. The method of claim 55, wherein the administering step comprises a mode of 
20 delivery selected from the group consisting of transdermal, intranodal, perinodal, oral, intravenous, 

intradermal, intramuscular, intraperitoneal, mucosal, aerosol inhalation, and instillation. 

57. The method of claim 55, further comprising a step of assaying to determine a 
characteristic indicative of a state of a target cell or target cells. 

58. The method of claim 57, comprising a first assaying step and a second assaying 
25 step, wherein the first assaying step precedes the administering step, and wherein the second 

assaying step follows the administering step. 

59. The method of claim 58, further comprising a step of comparing the characteristic 
determined in the first assaying step with the characteristic detennined in the second assaying step 
to obtain a result. 

30 60. The method of claim 59, wherein the result is selected from the group consisting 

of: evidence of an immune response, a diminution in number of target cells, a loss of mass or size 
of a tumor comprising target cells, a decrease in number or concentration of an intracellular 

parasite infecting target cells. 

61. A method of evaluating immunogenicity of a vaccine or immunotherapeutic 

35 composition, comprising: 
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administering to an animal the vaccine or immunotherapeutic composition of claim 

54; and 

evaluating immunogenicity based on a characteristic of the animal. 

62. The method of claim 6 1 , wherein the animal is HLA-transgenic. 

63. A method of evaluating immunogenicity , comprising: 

in vitro stimulation of a T cell with the vaccine or immunotherapeutic composition 

of claim 54; and 

evaluating immunogenicity based on a characteristic of the T cell. 

64. The method of claim 63, wherein the stimulation is a primary stimulation. 

65. A method of making a passive/adoptive immunotherapeutic, comprising: 
combining the T ceil of claim 40 or the host cell of claim 50 with a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

66. A method of determining specific T cell frequency comprising the step of 
contacting T cells with a MHC-peptide complex comprising the epitope of claim 1. 

67. The method of claim 66, wherein the contacting step comprises at least one 
feature selected from the group consisting of immunization, restimulation, detection, and 
enumeration. 

68. The method of Claim 66, further comprising ELISPOT analysis, limiting dilution 
analysis, flow cytometry, in situ hybridization, the polymerase chain reaction or any combination 
thereof. 

69. A method of evaluating immunologic response, comprising the method of claim 66 
carried out prior to and subsequent to an immunization step. 

70. A method of evaluating immunologic response, comprising: 

determining frequency, cytokine production, or cytolytic activity of T cells, prior 
to and subsequent to a step of stimulation with MHC-peptide complexes comprising the 
epitope of claim 1. 

71 . A method of diagnosing a disease comprising: 

contacting a subject tissue with at least one component selected from the group 
consisting of the T cell of claim 40, the host cell of claim 50, the antibody of claim 36, the 

protein of claim 46; and 

diagnosing the disease based on a characteristic of the tissue or of the component. 

72. The method of claim 71, wherein the contacting step takes place in vivo. 

73. The method of claim 71, wherein the contacting step takes place in vitro. 

74. A method of making a vaccine, comprising: 

combining at least one component selected from the group consisting of the 
epitope of claim 1; the composition of claim 18, 32, 45, or 53; the construct of claim 33; 
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the T cell of claim 40, and the host cell of claim 50, with a pharmaceutical^ acceptable 
adjuvant, carrier, diluent, or excipient. 

75. A computer readable medium having recorded thereon the sequence of any one of 
SEQ ID NOS: X - Y, in a machine having a hardware or software that calculates the physical, 

5 biochemical, immunologic, or molecular genetic properties of a molecule embodying said 
sequence. 

76. A method of treating an animal comprising combining the method of claim 55 
combined wife at least one mode of treatment selected from the group of radiation therapy, 
chemotherapy, biochemotherapy, and surgery. 

10 77. An isolated polypeptide comprising an epitope cluster from a target-associated 

antigen having the sequence as disclosed in Tables 25-44, wherein the amino acid sequence 
consists of not more than about 80% of the amino acid sequence of the antigen. 

78 . A vaccine or immunotherapeutic product comprising the polypeptide of claim 78 . 

79. An isolated polynucleotide encoding the polypeptide of claim 78 . 

15 80. A vaccine or immunotherapeutic product comprising the polynucleotide of claim 



80. 



81. The polynucleotide of claim 79 or 80, wherein the polynucleotide is DNA. 

82. The polynucleotide of claim 79 or 80, wherein the polynucleotide is RNA. 
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CTAGJiUMAN NY-ESO ID 
AAD05202 - CAG-3 (1) 
CAA11044 -LAGE-la (1) 
CAA10194 - LAGE-ls (1) 
CAAU043 - LAGE-lb (1) 
CAA10196 - LAGE-1L 
:AAH02833 CT-2 
Consensus 

CTAG HUMAN NY-ESO (51) 

AAD05202 - CAG-3 (51) 

CAA11044 -LAGE-la (51) 

CAA10194 - LAGE-ls (51) 

CAA11043 - LAGE-lb (51) 

CAA10196 - LAGE-1L (51) 
AAH02833 CT-2 
Consensus 



50 
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AAD05202 - CAG-3 
CAA1104 4 -LAGE-la 
CAA10194 - LAGE-ls 
CAA11043 - LAGE-lb 
CAA10196 - LAGE-1L 
AAH02833 CT-2 
Consensus 
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^n n T,^^^nADG PGGPGI PDGPGGNAGGP G EAGATGGRGPRGAGA 

vinAKC^GTGGS TGDADGPGGPGI PPGPGGNAGGP G ERGATGGRGPRGAG R 
... , _ ~ - GPGGNAGGPGE 



(1) 

{ J} ^^^^^^^^^^D^^^^^GATGGRGER^ 
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SAGA 
100 
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Autologous DC Present Al Peptide to 

CD 8 T cell 




HLA Al CD8 T Cell Density (xlOOO/well) 



FIG 11 
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Secretion of IFNgama Was Blocked 
by Anti-Al Antibody 




Human CD8T Cell Density (x 1000/well) 



FIG 12 
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1 3 927 1 3 927 1 3 927 1 3 927 

Dilution of culture 



Graphs show lysis of unpulsed EL4 cells (open circles) and EL4 cells pulsed wi\h gp33 peptide 
(solid triangles). Symbols represent individual mice and one of three similar experiments is 
shown. 



FIG 15 
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Days after tumor challenge 



Mean tumor volumes ±1SD are shown for mice immunized with pEFGPL33A DMA (solid circles) 
or control pEGFP-N3 DtiA (open triangles). Numbers in brackets indicate number of mice with 
tumors / total number of mice in group.. One of two similar experiments is shown. 



FIG 16 
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EPITOPE SEQUENCES 

Field of the Invention 

The present invention generally relates to peptides, and nucleic acids encoding peptides, 
that are useful epitopes of target-associated antigens. More specifically, the invention relates to 
5 epitopes that have a high affinity for MHC class I and that are produced by target-specific 

proteasomes. 

Description nf the Related Art 

Neo plasia and th e Tmmune System 

The neoplastic disease state commonly known as cancer is thought to result generally from 
1 0 a single cell growing out of control. The uncontrolled growth state typically results from a multi- 

step process in which a series of cellular systems fail, resulting in the genesis of a neoplastic cell. 
The resulting neoplastic cell rapidly reproduces itself, forms one or more tumors, and eventually 

may cause the death of the host. 

Because the progenitor of the neoplastic cell shares the host's genetic material, neoplastic 
1 5 cells are largely unassailed by the host's immune system During immune surveillance, the process 

in which the host's immune system surveys and localizes foreign materials, a neoplastic cell will 
appear to the host's immune surveillance machinery as a "self' cell. 
Viruses and the Immune System 

In contrast to cancer cells, virus infection involves the expression of clearly non-self 

20 antigens. As a result, many virus infections are successfully dealt with by the immune system with 

minimal clinical sequela. Moreover, it has been possible to develop effective vaccines for many of 
those infections that do cause serious disease. A variety of vaccine approaches have been used 
successfully to combat various diseases. These approaches include subunit vaccines consisting of 
individual proteins produced through recombinant DNA technology. Notwithstanding these 

25 advances, the selection and effective administration of minimal epitopes for use as viral vaccines 

has remained problematic. 

In addition to the difficulties involved m epitope selection stands the problem of viruses 
that have evolved the capability of evading a host's immune system. Many viruses, especially 
viruses that establish persistent infections, such as members of the herpes and pox virus families, 

30 produce immunomodulatory molecules that permit the virus to evade the host's immune system. 

The effects of these immunomodulatory molecules on antigen presentation may be overcome by 
the targeting of select epitopes for administration as immunogenic compositions. To better 
understand the interaction of neoplastic cells and virally infected cells with the host's immune 
system, a discussion of the system's components follows below. 

35 The immune system functions to discriminate molecules endogenous to an organism 

("self molecules) from material exogenous or foreign to the organism ("non-self 9 molecules). The 
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immune system has two types of adaptive responses to foreign bodies based on the components that 
mediate the response: a humoral response and a cell-mediated response. The humoral response is 
mediated by antibodies, while the cell-mediated response involves cells classified as lymphocytes. 
Recent anticancer and antiviral strategies have focused on mobilizing the host immune system as a 

5 means of anticancer or antiviral treatment or therapy. 

The immune system functions in three phases to protect the host from foreign bodies: the 
cognitive phase, the activation phase, and the effector phase, hi the cognitive phase, the immune 
system recognizes and signals the presence of a foreign antigen or invader in the body. The foreign 
antigen can be, for example, a cell surface marker from a neoplastic cell or a viral protein. Once the 

10 system is aware of an invading body, antigen specific cells of the immune system proliferate and 

differentiate in response to the invader-triggered signals. The last stage is the effector stage in which 
the effector cells of the immune system respond to and neutralize the detected invader. 

An array of effector cells implements an immune response to an invader. One type of 
effector cell, the B cell, generates antibodies targeted against foreign antigens encountered by the 

15 host In combination with the complement system, antibodies direct the destruction of cells or 

organisms bearing the targeted antigea Another type of effector cell is the natural killer cell (NK 
cell), a type of lymphocyte having the capacity to spontaneously recognize and destroy a variety of 
virus infected cells as well as malignant cell types. The method used by NK cells to recognize target 
cells is poorly understood. 

20 Another type of effector cell, the T cell, has members classified into three subcategories, each 

playing a different role in the immune response. Helper T cells secrete cytokines which stimulate the 
proliferation of other cells necessary for mounting an effective immune response, while suppressor T 
cells down-regulate the immune response. A third category of T cell, the cytotoxic T cell (CTL), is 
capable of directly lysing a targeted cell presenting a foreign antigen on its surface. 

25 The Maior Histocompatibility Complex and T Cell T arget Recognition 

T cells are antigen-specific immune cells that function in response to specific antigen signals. 
B lymphocytes and the antibodies they produce are also antigen-specific entities. However, unlike B 
lyir^hocytes, T cells do not respond to antigens in a free or soluble form. For a T cell to respond to 
an antigen, it requires the antigen to be processed to peptides which are then bound to a presenting 

30 structure encoded in the major histocompatibility complex (MHC). This requirement is called "MHC 

restriction" and it is the mechanism by which T cells differentiate "self* from "non-self cells. If an 
antigen is not displayed by a recognizable MHC molecule, the T cell will not recognize and act on the 
antigen signal. T cells specific for a peptide bound to a recognizable MHC molecule bind to these 
MHC-peptide complexes and proceed to the next stages of the immune response. 

35 There are two types of MHC, class I MHC and class H MHC. T Helper cells (CD4+) 

predominately interact with class E MHC proteins while cytolytic T cells (CD8 4 ) predominately 
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interact with class I MHC proteins. Both classes of MHC protein are transmembrane proteins with a . 
majority of their structure on the external surface of the cell. Additionally, both classes of MHC 
proteins have a peptide binding cleft on their external portions. It is in this cleft that small fragments 
of proteins, endogenous or foreign, are bound and presented to the extracellular environment 
5 Cells called "professional antigen presenting cells" (pAPCs) display antigens to T cells using 

the MHC proteins but additionally express various co-stimulatory molecules depending on the 
particular state of differentiation/activation of the pAPC. When T cells, specific for the peptide bound 
to a recognizable MHC protein, bind to these MHC-peptide complexes on pAPCs, the specific co- 
stimulatory molecules that act upon the T cell direct the path of differentiation/activation taken by the 

10 T cell. That is, the co-stimulation molecules affect how the T cell will act on antigenic signals in 
future encounters as it proceeds to the next stages of the immune response. 

As discussed above, neoplastic cells are largely ignored by the immune system. A great deal 
of effort is now being expended in an attempt to harness a host's immune system to aid in combating 
the presence of neoplastic cells in a host One such area of research involves the formulation of 

1 5 anticancer vaccines. 

Anticancer Vaccines 

Among the various weapons available to an oncologist in the battle against cancer is the 
immune system of the patient Work has been done in various attempts to cause the immune 
system to combat cancer or neoplastic diseases. Unfortunately, the results to date have been 
20 largely disappointing. One area of particular interest involves the generation and use of anticancer 
vaccines. 

To generate a vaccine or other immunogenic composition, it is necessary to introduce to a 
subject an antigen or epitope against which an immune response may be mounted. Although 
neoplastic cells are derived from and therefore are substantially identical to normal cells on a 

25 genetic level, many neoplastic cells are known to present tumor-associated antigens (TuAAs). In 

theory, these antigens could be used by a subject's immune system to recognize these antigens and 
attack the neoplastic cells. In reality, however, neoplastic cells generally appear to be ignored by 
the host's immune system. 

A number of different strategies have been developed in an attempt to generate vaccines 

30 with activity against neoplastic cells. These strategies include the use of tumor-associated antigens 
as immunogens. For example, U.S. Patent No. 5,993,828, describes a method for producing an 
immune response against a particular subunit of the Urinary Tumor Associated Antigen by 
ad^mnistering to a subject an effective dose of a composition comprising inactivated tumor cells 
having the Urinary Tumor Associated Antigen on the cell surface and at least one tumor associated 

35 antigen selected from the group consisting of GM-2, GD-2, Fetal Antigen and Melanoma 



-3- 



WO 2003/008537 



PCT/US2002/010189 



Associated Antigen. Accordingly, this patent describes using whole, inactivated tumor cells as the . 
immunogen in an anticancer vaccine. 

Another strategy used with anticancer vaccines involves administering a composition 
containing isolated tumor antigens. In one approach, MAGE-A1 antigenic peptides were used as 
5 an immunogen. (See Chaux, P., et al, "Identification of Five MAGE-A1 Epitopes Recognized by 
Cytolytic T Lymphocytes Obtained by In Vitro Stimulation with Dendritic Cells Transduced with 
MAGE-A1," J. Immunol., 163(5):2928-2936 (1999)). There have been several therapeutic trials 
using MAGE-A1 peptides for vaccination, although the effectiveness of the vaccination regimes 
was limited. The results of some of these trials are discussed in Vose, JJM, 'Tumor Antigens 

10 Recognized by T Lymphocytes," 10* European Cancer Conference, Day 2, Sept. 14, 1999. 

In another example of tumor associated antigens used as vaccines, Scheinberg, et al 
treated 12 chronic myelogenous leukemia (CML) patients already receiving interferon (IFN) or 
hydroxyurea with 5 injections of class I-associated bcr-abl peptides with a helper peptide plus the 
adjuvant QS-21. Scheinberg, DA, et aL, "BCR-ABL Breakpoint Derived Oncogene Fusion 

15 Peptide Vaccines Generate Specific Immune Responses in Patients with Chronic Myelogenous 
Leukemia (CML) [Abstract 1665], American Society of Clinical Oncology 35 th Annual Meeting, 
Atlanta (1999). Proliferative and delayed type hypersensitivity (DTH) T cell responses indicative 
of T-helper activity were elicited, but no cytolytic killer T cell activity was observed within the 
fresh blood samples. 

20 Additional examples of attempts to identify TuAAs for use as vaccines are seen in the 

recent work of Cebon, et aL and Scheibenbogen, et al Cebon, et al immunized patients with 
metastatic melanoma using intradermallly administered MART-l 2 6-35 peptide with IL-12 in 
increasing doses given either subcutaneously or intravenously. Of the first 15 patients, 1 complete 
remission, 1 partial remission, and 1 mixed response were noted. Immune assays for T cell 

25 generation included DTH, which was seen in patients with or without IL-12. Positive CTL assays 
were seen in patients with evidence of clinical benefit, but not in patients without tumor regression. 
Cebon, et al, "Phase I Studies of Immunization with Melan-A and EL-12 in HLA A2+ Positive 
Patients with Stage HI and IV Malignant Melanoma," [Abstract 1671], American Society of 
Clinical Oncology 35 th Annual Meeting, Atlanta (1999). 

30 Scheibenbogen, et al immunized 18 patients with 4 HLA class I restricted tyrosinase 

peptides, 16 with metastatic melanoma and 2 adjuvant patients. Scheibenbogen, et aL, 
"Vaccination with Tyrosinase peptides and GM-CSF in Metastatic Melanoma: a Phase II Trial," 
[Abstract 1680], American Society of Clinical Oncology 35 th Annual Meeting, Atlanta (1999). 
Increased CTL activity was observed in 4/15 patients, 2 adjuvant patients, and 2 patients with 

35 evidence of tumor regression. As in the trial by Cebon, et aL, patients with progressive disease did 
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not show boosted immunity. In spite of the various efforts expended to date to generate efficacious 
anticancer vaccines, no such composition has yet been developed. 
Antiviral Vaccines 

Vaccine strategies to protect against viral diseases have had many successes. Perhaps the 
most notable of these is the progress that has been made against the disease small pox, which has 
been driven to extinction. The success of the polio vaccine is of a similar magnitude. 

« 

Viral vaccines can be grouped into three classifications: live attenuated virus vaccines, 
such as vaccinia for small pox, the Sabin poliovirus vaccine, and measles mumps and rubella; 
whole killed or inactivated virus vaccines, such as the Salk poliovirus vaccine, hepatitis A virus 
vaccine and the typical influenza virus vaccines; and subunit vaccines, such as hepatitis B. Due to 
their lack of a complete viral genome, subunit vaccines offer a greater degree of safety than those 
based on whole viruses. 

The paradigm of a successful subunit vaccine is the recombinant hepatitis B vaccine based 
on the viruses envelope protein. Despite much academic interest in pushing the reductionist 
subunit concept beyond single proteins to individual epitopes, the efforts have yet to bear much 
fruit. Viral vaccine research has also concentrated on the induction of an antibody response 
although cellular responses also occur. However, many of the subunit formulations are particularly 
poor at generating a CTL response. 

Summary of the Invention 

Previous methods of priming professional antigen presenting cells (pAPCs) to display 
target cell epitopes have relied simply on causing the pAPCs to express target-associated antigens 
(TAAs), or epitopes of those antigens which are thought to have a high affinity for MHC I 
molecules. However, the proteasomal processing of such antigens results in presentation of 
epitopes on the pAPC that do not correspond to the epitopes present on the target cells. 

Using the knowledge that an effective cellular immune response requires that pAPCs 
present the same epitope that is presented by the target cells, the present invention provides 
epitopes that have a high affinity for MHC I, and that correspond to the processing specificity of 
the housekeeping proteasome, which is active in peripheral cells. These epitopes thus correspond 
to those presented on target cells. The use of such epitopes in vaccines can activate the cellular 
immune response to recognize the correctly processed TAA and can result in removal of target 
cells that present such epitopes. In some embodiments, the housekeeping epitopes provided herein 
can be used in combination with immune epitopes, generating a cellular immune response that is 
competent to attack target cells both before and after interferon induction- In other embodiments 
the epitopes are useful in the diagnosis and monitoring of the target-associated disease and in the 
generation of immunological reagents for such purposes. 
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Embodiments of the invention relate to isolated epitopes and antigens or polypeptides that . 
comprise the epitopes. Preferred embodiments include an epitope or antigen having the sequence 
as disclosed in TABLE i. Other embodiments can include an epitope cluster comprising a 
polypeptide from Table 1. Further, emodiments include a polypeptide having substantial similarity 
5 to the already mentioned epitopes, antigens, or clusters. Other preferred embodiments include a 
polypeptide having functional similarity to any of the above. Still further embodiments relate to a 
nucleic acid encoding the polypeptide of any of the epitopes, clusters, antigens, and polypeptides 
from Table 1 and mentioned herein. 

The epitope can be immunologically active. The polypeptide comprising the epitope can 
10 be less than about 30 amino acids in length, more preferably, the polypeptide is 8 to 10 amino acids 

in length, for example. Substantial or functional similarity can include addition of at least one 
amino acid, for example, and the at least one additional amino acid can be at an N-terminus of the 
polypeptide. The substantial or functional similarity can include a substitution of at least one 
amino acid. 

15 The epitope, cluster, or polypeptide comprising the same can have affinity to an HLA-A2 

molecule. The affinity can be determined by an assay of binding, by an assay of restriction of 
epitope recognition, by a prediction algorithm, and the like. The epitope, cluster, or polypeptide 
comprising the same can have affinity to an HLA-B7, HLA-B51 molecule, and the like. 

In preferred embodiments the polypeptide can be a housekeeping epitope. The epitope or 

20 polypeptide can correspond to an epitope displayed on a tumor ceil, to an epitope displayed on a 

neovasculature cell, and the like. The epitope or polypeptide can be an immune epitope. The 
epitope, cluster and/or polypeptide can be a nucleic acid. 

Other embodiments relate to pharmaceutical compositions comprising the polypeptides, 
including an epitope from Table 1, a cluster, or a polypeptide comprising the same and a 

25 pharmaceutical^ acceptable adjuvant, carrier, diluent, excipient, and the like. The adjuvant can be 

a polynucleotide. The polynucleotide can include a dinucleotide. The dinucleotide can be CpG, 
for example. The adjuvant can be encoded by a polynucleotide. The adjuvant can be a cytokine 
and the cytokine can be, for example, GM-CSF. 

The pharmaceutical compositions can further include a professional antigen-presenting cell 

30 (pAPC). The pAPC can be a dendritic cell, for example. The pharmaceutical composition can 

further include a second epitope. The second epitope can be a polypeptide. The second epitope 
can be a nucleic acid. The second epitope can be a housekeeping epitope, an immune epitope, and 
the like. 

Still further embodiments relate to pharmaceutical compositions that include any of the 
35 nucleic acids discussed herein, including those that encode polypeptides that comprise epitopes or 
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antigens from Table 1. Such compositions can include a pharmaceutical^ acceptable adjuvant, « 
carrier, diluent, excipient, and the like. 

Other embodiments relate to recombinant constructs that include such a nucleic acid as 
described herein, including those that encode polypeptides that comprise epitopes or antigens from 
5 Table 1. The constructs can further include a plasmid, a viral vector, an artificial chromosome, and 

the like. The construct can further include a sequence encoding at least one feature, such as for 
example, a second epitope, an IRES, an ISS, an MS, ubiquitin. 

Further embodiments relate to purified antibodies that specifically bind to at least one of 
the epitopes in Table 1A. Other embodiments relate to purified antibodies that specifically bind to 
10 a peptide-MHC protein complex comprising an epitope disclosed in Table 1A or any other suitable 

epitope. The antibody from any embodiment can be a monoclonal antibody. 

Still other embodiments relate to multimeric MHC-peptide complexes that include an 
epitope, such as, for example, an epitope disclosed in Table 1. 

Embodiments relate to isolated T cells expressing a T cell receptor specific for an MHC- 
15 peptide complex. The complex can include an epitope, such as, for example, an epitope disclosed 

in Table 1 of claim 1. The T ceil can be produced by an in vitro immunization. The T cell can be 
isolated from an immunized animal. Embodiments relate to T cell clones, including cloned T cells, 
such as those discussed above. Embodiments also relate to polyclonal population of T cells. Such 
populations can include a T cell, as described above, for example. 
20 Still further embodiments relate to pharmaceutical compositions that include a T cell, such 

as those described above, for example, and a pharmaceutically acceptable adjuvant, carrier, diluent, 
excipient, and the like. 

Embodiments of the invention relate to isolated protein molecules comprising the binding 
domain of a T cell receptor specific for an MHC-peptide complex. The complex can include an 

25 epitope disclosed in Table 1 . The protein can be multivalent. Other embodiments relate to isolated 

nucleic acids encoding such proteins. Still further embodiments relate to recombinant constructs 
that include such nucleic acids. 

Other embodiments of the invention relate to host cells expressing the recombinant 
construct described herein, including constructs encoding an epitope, cluster or polypeptide 

30 comprising the same, disclosed in Table 1, for example. The host cell can be a dendritic cell, 

macrophage, tumor cell, tumor-derived cell, and the like. The host cell can be a bacterium, fungus, 
protozoan and the like. Embodiments also relate to pharmaceutical compositions that include a 
host cell, such as those discussed herein, and a pharmaceutically acceptable adjuvant, carrier, 
diluent, excipient, and the like. 

35 Still other embodiments relate to vaccines or immunotherapeutic compositions that include 

at least one component, such as, for example, an epitope disclosed in Table 1 or otherwise 
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described herein; a cluster that includes such an epitope, an antigen or polypeptide that includes * 
such an epitope; a composition described above and herein; a construct, a T cell, or a host cell as 
described above and herein. 

Further embodiments relate to methods of treating an animal. The methods can include 
5 administering to an animal a vaccine or immunotherapeutic composition, including those disclosed 
above and herein. The administering step can include a mode of delivery, such as, for example, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
mucosal, aerosol inhalation, instillation, and the like. The method can further include a step of 
assaying to determine a characteristic indicative of a state of a target cell or target cells. The 

10 method can include a first assaying step and a second assaying step, wherein the first assaying step 
precedes the administering step, and wherein the second assaying step follows the administering 
step. The method can further include a step of comparing the characteristic determined in the first 
assaying step with the characteristic determined in the second assaying step to obtain a result. The 
result can be for example, evidence of an immune response, a diminution in number of target cells, 

15 a loss of mass or size of a tumor comprising target cells, a decrease in number or concentration of 

an intracellular parasite infecting target cells, and the like. 

Embodiments relate to methods of evaluating immunogenicity of a vaccine or 
immunotherapeutic composition. The methods can include administering to an animal a vaccine or 
immunotherapeutic, such as those described above and elsewhere herein, and evaluating 

20 immunogenicity based on a characteristic of the animal. The animal can be HLA-transgenic. 

Other embodiments relate to methods of evaluating immunogenicity that include in vitro 
stimulation of a T cell with the vaccine or immunotherapeutic composition, such as those described 
above and elsewhere herein, and evaluating immunogenicity based on a characteristic of the T cell. 
The stimulation can be a primary stimulation. 

25 Still further embodiments relate to methods of making a passive/adoptive 

immunotherapeutic. The methods can include combining a T cell or a host cell, such as those 
described above and elsewhere herein, with a pharmaceutically acceptable adjuvant, carrier, 
diluent, excipient, and the like. 

Other embodiments relate to methods of determining specific T cell frequency, and can 

30 include the step of contacting T cells with a MHC-peptide complex comprising an epitope 

disclosed in Table 1, or a complex comprising a cluster or antigen comprising such an epitope. The 
contacting step can include at least one feature, such as, for example, immunization, restimulation, 
detection, enumeration, and the like. The method can further include ELISPOT analysis, limiting 
dilution analysis, flow cytometry, in situ hybridization, the polymerase chain reaction, any 

35 combination thereof, and the like. 
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Embodiments relate to methods of evaluating immunologic response. The methods can « 
include the above-described methods determining specific T cell frequency carried out prior to and 
subsequent to an immunization step. 

Another embodiment relates to methods of evaluating immunologic response. The 
5 methods can include determining frequency, cytokine production, or cytolytic activity of T cells, 

prior to and subsequent to a step of stimulation with MHC-peptide complexes comprising an 
epitope, such as, for example an epitope from Table 1, a cluster or a polypeptide comprising such 
an epitope. 

Further embodiments relate to methods of diagnosing a disease. Hie methods can include 
10 contacting a subject tissue with at least one component, including, for example, a T cell, a host cell, 

an antibody, a protein, including those described above and elsewhere herein; and diagnosing the 
disease based on a characteristic of the tissue or of the component. The contacting step can take 
place in vivo. The contacting step can take place in vitro. 

Still other embodiments relate to methods of making a vaccine. The methods can include 
15 combining at least one component, an epitope, a composition, a construct, a T cell, a host cell; 

including any of those described above and elsewhere herein, with a pharmaceutically acceptable 
adjuvant, carrier, diluent, excipient, and the like. 

Embodiments relate to computer readable media having recorded thereon the sequence of 
any one of SEQ ID NOS: 1 -602, in a machine having a hardware or software that calculates the 
20 physical, biochemical, immunologic, or molecular genetic properties of a molecule embodying said 

sequence. 

Still other embodiments relate to methods of treating an animal. The methods can include 
combining the method of treating an animal that includes administering to the animal a vaccine or 
immunothcrapeutic composition, such as described above and elsewhere herein, combined with at 

25 least one mode of treatment, including, for example, radiation therapy, chemotherapy, 

biochemotherapy, surgery, and the like. 

Further embodiments relate to isolated polypeptides that include an epitope cluster from a 
target-associated antigen having the sequence as disclosed in any one of Tables 25-44, wherein the 
amino acid sequence includes not more than about 80% of the amino acid sequence of the antigen. 

30 Other embodiments relate to vaccines or immunotherapeutic products that include an 

isolated peptide as described above and elsewhere herein. Still other embodiments relate to 
isolated polynucleotides encoding a polypeptide as described above and elsewhere herein. Other 
embodiments relate vaccines or immunotherapeutic products that include these polynucleotides. 
The polynucleotide can be DNA or RNA. 

35 
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Brief Description of the Drawings 
Figure 1 is a sequence alignment of NY-ESO-1 and several similar protein sequences. 
Figure 2 graphically represents a plasmid vaccine backbone useful for delivering nucleic 
acid-encoded epitopes. 

5 Figures 3A and 3B are FACS profiles showing results of HLA-A2 binding assays for 

tyrosinase207-2i5 and tyrosinase208-2i6- 

Figure 4 is a T=120 min. time point mass spectrum of the fragments produced by 
proteasomal cleavage of SSX-2 3 i. 6 8. 

Figure 5 shows a binding curve for HLA-A2:SSX-24m 9 with controls. 
10 Figure 6 shows specific lysis of SSX-2 4Mir -pulsed targets by CTL from SSX-2 4M9 - 

immunized HLA-A2 transgenic mice. 

Figure 7A, B, and C show results of N-terminal pool sequencing of a T=60 min. time point 
aliquot of the PSMA163-192 proteasomal digest. 

Figure 8 shows binding curves for HLA-A2:PSMA 168 _i77 and HLA-A2:PSMA288-297 with 
15 controls. 

Figure 9 shows results of N-terminal pool sequencing of a T=60 min. time point aliquot of 
the PSMA 2 8i-3io proteasomal digest 

Figure 10 shows binding curves for HLA-A2:PSMA46i^9, HLA-A2:PSMA46<m*9, and 
HLA-A2:PSMA* 63 _ € 7i } with controls. 
20 Figure 1 1 shows the results of a y-IFN-based ELISPOT assay detecting PSMA^ti- 

reactive HLA-A1 + CD8 + T cells. 

Figure 12 shows blocking of reactivity of the T cells used in figure 10 by anti-HLA-Al 
mAb, demonstrating HLA-A1 -restricted recognition. 

Figure 13 shows a binding curve for HLA-A2:PSMA663-<s7i, with controls. 
25 Figure 14 shows a binding curve for HLA-A2:PSMA 662 -«7i, with controls. 

Figure 15. Comparison of anti-peptide CTL responses following immunization with 
various doses of DNA by different routes of injection. 

Figure 16. Growth of transplanted gp33 expressing tumor in mice immunized by i.ln. 
injection of gp33 epitope-expressing, or control, plasmid. 
30 Figure 17. Amount of plasmid DNA detected by real-time PCR in injected or draining 

lymph nodes at various times after ita. of i.m. injection, respectively. 

Detailed Description of the Preferred Embodiment 

Definitions 

Unless otherwise clear from the context of the use of a term herein, the following listed 
35 terms shall generally have the indicated meanings for purposes of this description. 
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PROFESSIONAL ANTIGEN-PRESENTING CELL (pAPC) - a cell that possesses T cell 
costimulatory molecules and is able to induce a T cell response. Well characterized pAPCs include 
dendritic cells, B cells, and macrophages. 

PERIPHERAL CELL - a cell that is not a pAPC. 

HOUSEKEEPING PROTEASOME - a proteasome normally active in peripheral cells, and 
generally not present or not strongly active in pAPCs. 

IMMUNE PROTEASOME - a proteasome normally active in pAPCs; the immune 
proteasome is also active in some peripheral cells in infected tissues. 

EPITOPE - a molecule or substance capable of stimulating an immune response. In 
preferred embodiments, epitopes according to this definition include but are not necessarily limited 
to a polypeptide and a nucleic acid encoding a polypeptide, wherein the polypeptide is capable of 
stimulating an immune response. In other preferred embodiments, epitopes according to this 
definition include but are not necessarily limited to peptides presented on the surface of cells, the 
peptides being non-covalently bound to the binding cleft of class I MHC, such that they can 
interact with T cell receptors. 

MHC EPITOPE - a polypeptide having a known or predicted binding affinity for a 
mammahan class I or class II major histocompatibility complex (MHC) molecule. 

HOUSEKEEPING EPITOPE - In a preferred embodiment, a housekeeping epitope is 
defined as a polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which 
housekeeping proteasomes are predominantly active. In another preferred embodiment, a 
housekeeping epitope is defined as a polypeptide containing a housekeeping epitope according to 
the foregoing definition, that is flanked by one to several additional amino acids. In another 
preferred embodiment, a housekeeping epitope is defined as a nucleic acid that encodes a 
housekeeping epitope according to the foregoing definitions. 

IMMUNE EPITOPE - In a preferred embodiment, an immune epitope is defined as a 
polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which immune 
proteasomes are predominantly active. In another preferred embodiment, an immune epitope is 
defined as a polypeptide containing an immune epitope according to the foregoing definition, that 
is flanked by one to several additional amino acids. In another preferred embodiment, an immune 
epitope is defined as a polypeptide including an epitope cluster sequence, having at least two 
polypeptide sequences having a known or predicted affinity for a class I MHC. In yet another 
preferred embodiment, an immune epitope is defined as a nucleic acid that encodes an immune 
epitope according to any of the foregoing definitions. 

TARGET CELL - a cell to be targeted by the vaccines and methods of the invention. 
Examples of target cells according to this definition include but are not necessarily limited to: a 
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neoplastic cell and a cell harboring an intracellular parasite, such as, for example, a virus, a 
bacterium, or a protozoan. 

TARGET-ASSOCIATED ANTIGEN (TAA) - a protein or polypeptide present in a target 

cell. 

TUMOR-ASSOCIATED ANTIGENS (TuAA) - a TAA, wherein the target cell is a 
neoplastic cell. 

HLA EPITOPE - a polypeptide having a known or predicted binding affinity for a human 
class I or class H HLA complex molecule. 

ANTIBODY - a natural immunoglobulin (Ig), poly- or monoclonal, or any molecule 
composed in whole or in part of an Ig binding domain, whether derived biochemically or by use of 
recombinant DNA. Examples include inter alia, F(ab), single chain Fv, and Ig variable region- 
phage coat protein fusions. 

ENCODE - an open-ended term such that a nucleic acid encoding a particular amino acid 
sequence can consist of codons specifying that (poly)peptide, but can also comprise additional 
sequences either translatable, or for the control of transcription, translation, or replication, or to 
facilitate manipulation of some host nucleic acid construct. 

SUBSTANTIAL SIMILARITY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of the sequence. Nucleic 
acid sequences encoding the same amino acid sequence are substantially similar despite differences 
in degenerate positions or modest differences in length or composition of any non-coding regions. 
Amino acid sequences differing only by conservative substitution or minor length variations are 
substantially similar. Additionally, amino acid sequences comprising housekeeping epitopes that 
differ in the number of N-terminal flanking residues, or immune epitopes and epitope clusters that 
differ in the number of flanking residues at either terminus, are substantially similar. Nucleic acids 
that encode substantially similar amino acid sequences are themselves also substantially similar. 

FUNCTIONAL SIMILARITY - this term is used to refer to sequences that differ from a 
reference sequence in an inconsequential way as judged by examination of a biological or 
biochemical property, although the sequences may not be substantially similar. For example, two 
nucleic acids can be useful as hybridization probes for the same sequence but encode differing 
amino acid sequences. Two peptides that induce cross-reactive CTL responses are functionally 
similar even if they differ by non-conservative amino acid substitutions (and thus do not meet the 
substantial similarity definition). Pairs of antibodies, or TCRs, that recognize the same epitope can 
be functionally similar to each other despite whatever structural differences exist. In testing for 
functional similarity of immunogenicity one would generally immunize with the "altered" antigen 
and test the ability of the elicited response (Ab, CTL, cytokine production, etc.) to recognize the 
target antigen. Accordingly, two sequences may be designed to differ in certain respects while 
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retaining the same function. Such designed sequence variants are among the embodiments of the 



present invention. 

Table 1A. SEQ ID NOS.* including epitopes in Examples 1-7, 13. 



SEQ ID NO 


niTTNTTTV 

LXSXVi^ JLX X X 




1 


Tyr 207-216 


FLPWHRLFLL 


2 


Tyrosinase protein 


Accession number**: P14679 
Accession number: NP_003138 
Accession number: NP 004467 
Accession number: NM 000372 
Accession number: NM 003147 
Accession number: NM 004476 


3 


SSX-2 protein 


4 


PSMA protein 


5 


Tyrosinase cDNA 


6 


SSX-2 cDNA 


mm 

7 


PSMA cDNA 


8 


Tyr 207-215 


FLPWHRLFL 


9 


Tyr 208-216 


LPWHRLFLL 


10 


SSX-2 31-68 


YFSKEEWEKMKASEKIFYVYMKRKYEAMTKLGFK 
ATLP 


11 


SSX-2 32-40 


FSKEEWEKM 


12 




O/nCASFKTF 


13 


ccy.o 40-48 




14 


QGY.9 19-48 


iviyj rs> fij rvix j r x 


15 




iv/TjLjj_iivi_r x v 


16 




N/ntASFinFYV 


17 


<I^Y-9 41 -SO 


(S^r^oJufwxr xvi 


18 


49-49 


ASFKTFYVY 
rtoxvxvxr xvi 


19 




RKYFAMTKT 


20 


OOA Z 


IVIXJV I J_,/\1VI 1 IVL j 


21 


Ol)A - Z J*t"Uj 


R'VFAIWTK'T jOF 

IV X Xv/T-LVI X rV-LAJX 


22 




VFAMTKT HF 


23 


uOA ^ JU"UJ 


FAMTYT OF 


24 


pTRVl 8-9 7 

CUD V 1 O-Z. / 


rLriui/ X PXTO V 


25 


TTT A-R44 kmHpr 


AFMfrKYSFY 
rvCrivivJ JX. X Of X 


26 


uOA 1 *tl"*t7 


rv x oxjXvxlj x v 


27 


SSX-3 41-49 


KVSEKIVYV 

IV T L_J_1_JJLV-1. TAT 


28 


SSX-4 41-49 


KSSEK1YYV 


29 


SSX-5 41-49 


KASEKHYV 


30 


PSMA163-192 


AFSPQGMPEGDLVYVNYARTEDFFKLERDM 


31 


PSMA 168-190 


GMPEGDLVYVNYARTEDFFKLER 


32 


PSMA 169-177 


MPEGDLVYY 


33 


PSMA 168-177 


GMPEGDLVYV 


34 


PSMA 168-176 


GMPEGDLVY 


35 


PSMA 167-176 


QGMPEGDLVY 


36 


PSMA 169-176 


MPEGDLVY 
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37 


PSMA 1 71 -170 

IT OIVX/A I I A " 1 / y 


FtrDT VYVNY 


38 


PSA/TA 170.-1 7Q 


PTTttFiT VYVMV 
rCiVjui-i v 1 v in x 


39 


PSMA 17d-1fn 


r VVVMVATJTP 

L»Y I VIN I AxvlH ( 


40 


PSMA 177-1R5 


VAN I AJv±Il±/r i 


41 


PSIMA 17fi-1Jtt 


VVTsTV A P TPrVR 1 


42 


PSMA 178-15*^ 


MV A p TT7HT7P 


43 


PSMA 17Q-1R£ 




44 


P9MA 1 81 1 RQ 




45 


PQlVvfA 7C1 11 fl 
toMA Zo lO 1U 


P m A 17 A \/Y"JT X5CTO\/XXDT/^l'WT\ A A17T T UVK jf/" 1 I 

KUrlAJiA V KjLr olr Vlxrltj Y YJJAvl^LliK*MtJ i 


46 


DCUA 7fil 107 


TAT! A T> O TD\ /ITDTflWH A m/T T T? 

IAHA V Oivroir V rLrlvi Y Y DA\J rLLLb | 


47 


PdV>f A 7 CO 7Q7 


T T>OTD\/TTT>T 1 

LrDJrViiJrl j 


48 


PQMA 7fifi 707 


rjT PCTPAHUPT I 
LxJLr o JLr V xlr I j 


49 


pciyfA 007 


lux I LJAl^xvL 


50 


pqma 70/; in^ 


PTOWTl A AIT T 1 


51 


P^IMA 701 700 


Q TPVTTPTrJV 

olx VxlJrlvj i I 


52 


dctv/IA 700-700 


P Q TP \7TTPTf5 V 


53 


PSMA 707-700 


Xx V rLr LKj x 1 


54 


PQMA 7Q0 1A7 


wn A nVT T T7 

I I JJAV^JvJLlvll | 


55 


PQMA/Kzt-dRI 


QCTTmXTVTT 'D\/T^r r PDT A/TVCT "\ rrjT Tfnr 1 

ooioLrlN 1 1 LrX V JJO 1 r .LiVl Y oJL V HLllvbL 


56 


roJYLA *O0-*f04 


lxiLrXN Y 1 JLK V | 


57 




oJLCVJJN Y 1 LKV j 


58 




PfiKTVTT T>\f 1 
tSvJiN I 1 LiK V 


59 




"PT 1? Wl^/^TTJT I 


60 


pc\/r a /i/rn^d/^Q 




61 


rolVlA *H3Z-*f /U 


LJvVlJL/lrlJVl 1 


62 


PCAjTA /l£l /I71 




63 


oMA 40Z-4 / 1 


r T>\rrv" v T T DT A IT\T 1 


64 


roMAOjj-Oo / 


.'i \X^' OTVTttr^ rT D A i\nWT ■» jtttt t-*t** a i"'Ti \t>t /^r nr\nnrtr ! 

t*DK&NrlVLKJVxMNl^LM | 


65 


PSMA 660-681 


VLRJvlM^QLMFLERAFro 


66 


PSMA 663-671 


VIMNDQLMFL 


67 


PSMA 662-671 


^MMNDQLMFL 


68 


PSMA 662-670 


iMMNDQLMF 


69 


ryrl-17 I 


V1LLAVLYCLLWSF0TSA I 


Table IB. SEQ ID NOS* including epitopes in Examples 14 and 15. 


SEQ ID NO t 


CDENTTTY 


SEQUENCE 


70 


3P 100 protein 


♦♦Accession numben P40967 


71 ! 


VIAGE-1 protein 


Accession number: P43355 


72 | 


vlAGE-2 protein 


Accession number: P43356 


73 ! 


VlAGE-3 protein 


Accession number: P43357 


74 , 


nT-ESO-1 protein 


Accession number: P78358 


75 I 


.AGE- la protein 


Accession number: CAA1 1116 



-14- 



WO 2003/008537 



PCT/US2002/010189 



76 


LAiJc-lD protein 


Accession nuniDer. aa 1111/ 


77 


rKAJVLc/ protein 


Accession nuniDer. iNr uuoiuo 


78 


roA protein 


Accession nuniDer. ru/zoo 


79 


roLA protein 


Accession nuniDer. lwjOjj 


80 


UrlOU cas 


Accession nuniDer: uzuuy^ 


81 


MAGE-1 Cds 


Accession nuinDen M/ /4oi 


82 


MACjb-Z COS 


Accession numDex: jLisyzu i 


83 

W— ' 


LVlAvxb-J COS 


Accession nuniDer. Wd/jj 


84 


NY-iioU-1 ClJxNA 


Accession nuniDer. u s /h j y 


85 


rKAMri CDJN A 


Accession numDer: xNivi uuo i id 


86 


roA CDIN A 


Accession nuniDer: JNM UUio4o 


87 


PSCA cDNA 


Accession number: ArU4J4yo 


88 

DO 


GP100 630-638 


LPHSSSHWL 


89 


GP100 629-638 

\JX l\J\J \Ji-iS UJO 


OLPHSSSHWL 

V^XjX X JLUUUXX TT X-/ 




GP100 614-622 

VJX Lvu w X 1 \Ji*t* 


L1TRRRLMK 

L/X X 1V1\X\ l.tlTXIV 


91 


GP100 613-622 


SLTYRRRLMK 


92 


GP100 615-622 


& A X ViVXVlJl tJJ-V 




GP100 630-638 


LPHSSSHWL 


94 

✓ • 


GP100 629-638 


OLPHSSSHWL 


95 


MAGE-1 95-102 


ESLFRAVI 


96 


MAGE-1 93-102 


rr KSLFRAVI 

LXfXvUXVl AUX T X 


97 


MAGE-1 93-101 

ItJjIVJ 1,,/ X ✓ J I V X 


DLESLFRAV 


98 


MAGE-1 92-101 

LYXXWJX^S X VX< XVX 


CILESLFRAV 


99 

✓ ✓ 


MAGE-1 92-100 

i rin vj Xj x -X x* i \j \j 


CILESLFRA 


100 


MAGE-1 263-271 

ITliXVJjL^r 1 X*\SapT X* I X 


EFLWGPRAL 


101 


MAGE-1 264-271 

1TUX\JU 1 Xrf W 1 Xtf / X 


FLWGPRAL 

*. X-i T» VJX JLw 1 1,/ 


102 


MAGE-1 264-273 


FLWGPRAL AE 

X X/ IT V#X l\x iX/iULl 


101 


MAGE-1 265-274 


r WGPRAT AET 

XaT T T VJX l\XVl-ui I / X 


104 


MAGF-1 268-276 


PRALAETSY 

x xv/iXvrkL« x o x 


10^ 


MAGF-1 267-276 

IVxxxVJ 1 X.VJ/ - XW U 


GPRAT AFTSY 

VJX7 XVf*l irLL< A LJ X 


X V/U 


MAGE-1 269-277 


RALAETSYV 

IVTiXiilXi X O X Y 


107 

IV// 


MAGE-1 271-279 


LAETSYVKV 


10R 


MAGE-1 270-279 

XYJ-TxUX-/— 1 x» f V *r / ✓ 


ALAETSYVKV 

rujiuj i u i ¥ x*. t 


10Q 


MAGE-1 272-280 


AETSYVKVL 

n.i / x u x y xv v j—> 


1 10 


MAGE-1 271-280 


LAETSYVKVL 

LuUU X LJ X Y XV Y X_/ 


111 
ill 


MAGF-1 274-282 


TSYVKVLFY 

X VJ X T XV Y M-il~i X 


112 


MAGE-1 273-282 


ETSYVKVLEY 


113 


MAGE-1 278-286 


KVLEYVIKV 


114 


MAGE-1 168-177 


SYVLVTCLGL 


115 


MAGE-1 169-177 


YVLVTCLGL 


116 


MAGE-1 170-177 


VLVTCLGL 


117 


MAGE-1 240-248 


TQDLVQEKY 
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118 


MAGE-1 239-248 


LTODLVOEKY 


119 


MAGE-1 232-240 


YGEPRKLLT 

X WX/Ji X X-A m J X. af X 


120 


MAGE-1 243-251 

» T 1™^" «™ ^""^ ^™ 0 ajar JBBF 


LVQEKYLEY 


121 


MAGE-1 242-251 


DLVOEKYLEY 


122 


MAGE-1 230-238 


SAYGEPRKL 

ka*A A, A ■ #A AVJLKX/ 


123 


MAGE-1 278-286 


KVLEYVIKV 


124 


MAGE-1 277-286 


VKVLEYVTKV 


125 


MAGE-1 276-284 


YVKVLEYVI 

A T Alt T 1 " * A T A 


126 

X As/ V 


MAGE-1 274-282 


TSYVKVLEY 

A U A T A^ T lilt A 


1 127 


MAGE-1 273-282 


ETSYVKVLEY 


128 


MAGE-1 283-291 

k'A* A^Ba#A»af X *fa/VV«V Xrf X 


VIKVSARVR 


129 


MAGE-1 282-291 


YVIKVSARVR 


130 


MAGE-2 115-122 


ELVHFLLL 

AVXf * XX_X XVXV1V 


131 


MAGE-2 113-122 


MYELVHFLLL 

ITA T l<l ) T AAA 1 > 1 1 1 « 


132 

X w XW 


MAGE-2 109-116 

ITAi 1,# Ji< Aw*' AAV 


ISRKMVEL 


133 


MAGE-2 108-116 


AISRKMVEL 


1 134 


MAGE-2 107-116 

IT XX LXJlW XV X V / X. A W 


AAISRKMVEL 

rLTUUlVlVLTl V 1-<1. / 


135 


MAGE-2 112-120 


OIVELVHFL 

LVJ.VA V 1 li-i Y I 11 


136 


MAGE-2 109-117 

XYAX X * / Artf X \J ^ XX/ 


ISRKMVELV 


137 


MAGE-2 108-117 


AJSRKMVELV 


138 


MAGE-2 116-124 

l * xx x v xv xw x x \j x Xrf r 


LVHFLLLKY 

*— ' ▼ AAA 1/1 ,/AV A 


139 


MAGE-2 115-124 


ELVHFLLLKY 

Ivly t XXX xvxvxvlx X 


140 


MAGE-2 111-119 


RKMVELVHF 

IVLVlfl V l./l / T AAA. 


141 


MAGE-2 158-166 


LOLVFGIEV 

A->\^J— ( V X \ 1 1 1 / V 


142 


MAGE-2 157-166 


YLOLVFGTRV 


143 


MAGE-2 159-167 


0LVFG1EW 


144 


MAGE-2 158-167 


LOLVFGIEW 

A-/ / T A V^J 11/ T T 


145 


MAGE-2 164-172 


IEWEWPI 

11/ T T A-/ V V A A 


146 


MAGE-2 163-172 


GTEWEWPI 

vJ 111 T T A_/ T TAX 


147 |MAGE-2 162-170 


FGIEWl^VV 


148 


MAGE-2 154-162 


ASEYLOT VF 


149 


MAGE-2 153-162 


(CASFYT OT VF 

LV/lAJjJr X XjV/Xj V X 


150 


MAGE-2 218-225 


PF1CTWFFT 


151 


MAGE-2 216-225 


APEEK1WFFT 

ru i / 1 vxvx vv i <i »^ - 


152 


MAGE-2 216-223 


APEEKIWE 

111 1 rflJrJVA V T JJr 


153 


MAGE-2 220-228 


SQWEELSML 

[VI. TT AjJ <J>OXTXJU 


154 


MAGE-2 219-228 


EKIWEELSML 


155 


MAGE-2 271-278 


FLWGPRAL 


156 1 


MAGE-2 271-279 


FLWGPRALI 


157 I 


MAGE-2 278-286 


LIETSYVKV 


158 1 


MAGE-2 277-286 , 


\LIETSYVKV 


159 1 


MAGE-2 276-284 1 


^ALIETSYV 
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160 


MAGE-2 279-287 


LETSYVKVL 


161 

a *# a 


MAGE-2 278-287 


LIETSYVKVL 


! 162 

JL waff 


MAGE-3 271-278 


FLWGPRxAJL 


163 


MAGE-3 270-278 

111* AX^pJJkrf pp* mm 1 V M * W 


EFLWGPRAL 

1 f p 1 1 r ¥ » -x - ~t-X *p ■ ' 


164 


MAGE-3 271-279 


FLWGPRALV 


165 


MAGE-3 276-284 

kTAA A%vJ Ai AV J \^ 1 


RALVETSYV 


166 


MAGE-3 272-280 


LWGPRALVE 

*W IT A A»»k ■ ■ * ▼ A* 


167 


MAGE-3 271-280 


FLWGPRALVE 


; 168 

x wo 


MAGE-3 27 2-281 

LYXXxNJX^ J Xf I 4m XV U X 


LWGPRALVET 


169 


NY-ESO-1 82-90 


GPESRLLEF 


170 


NY-ESO-1 83-91 


PESRLLEFY 

A AJUa» 1 /I J K * * A 


171 

A § A 


NY-ESO-1 82-91 


GPESRLLEFY 


172 

A / A* 


NY-ESO-1 84-92 


ESRLLEFYL 


173 


NY-ESO-1 86-94 


RJLLEFYLAM 


174 

Aft 


NY-ESO-1 88-96 


LEFYLAMPF 


175 


NY-ESO-1 87-96 


LLEFYLAMPF 

MUJWJi A JV* AATA A A 


176 


NY-ESO-1 93-102 


AMPFATPMEA 

ft AA« AJL A A A A A AT ■ ArfA A 


177 

x # # 


NY-ESO-1 94-102 

A T, X X^kJV/ A V A VA< 


MPFATPMEA 

ITJUI A * A A A A T A^Vi A 


178 


NY-ESO-1 115-123 


PLPVPGVLL 

*- A^/A T A T *mmJ XmJ 


179 


NY-ESO-1 114-123 

A^ A. J mf±m0 ^m-T A A A I A A* 


PPLPVPGVLL 


180 

A W 


NY-ESO-1 116-123 


LPVPGVLL 


1 181 

& W A 


NY-ESO-1 103-112 

X 1 4k mmmf^mgr ^^4^ Jk A ^4* 4V A. 4h« 


ELARRSLAOD 


182 


NY-ESO-1 118-126 

A ^ Jk *mmJ*t*f ^mm* A A A V A A* V 


VPGVLLKEF 

T A T — kUx 


183 


NY-ESO-1 117-126 

L 1 A A^L/x/ A A A f A m%mf\J 


P VPGVLLKEF 

A T A VJ1 w E ilJ|\ 1 4 ft 


184 


NY-ESO-1 116-123 

L 1 X XyU\/ X XXV X 4*mJ 


LPVPGVLL 

X_/X T X \J T X #JLV 


185 


NY-ESO-1 127-135 


TVSGNILTI 


186 

X OH 


NY-ESO-1 126-135 


FTVSGNILTI 

x X T Uvll'l 1 L.J X X 


187 


NY-ESO-1 120-128 


GVLLKEFTV 


188 


NY-ESO-1 121-130 

L ^ JL A A A A W V 


VLLKEFTVSG 


189 

A W ^ 


NY-ESO-1 122-130 


LLKEFTVSG 

pX_^.^k^A ^ m -* 4k r Vk*> 


190 

A X V 


iw-ESO-1 118-126 


VPGVLLKEF 


191 

x ^ x 


NY-ESO-1 117-126 

ATA ^mfK+f ^Hm* A A A f AflfV 


PVPGVLLKEF 

ft T A \J T JxW a 41 a 1 <A 


192 


NY-ESO-1 139-147 

A ^ A- AJW7 XX A A mJ *p A J 


AADHROLOL 


193 

A ✓ pp* 


NY-ESO-1 148-156 


SISSCLOOL 


194 


NY-ESO-1 147-156 

xix juuv x x~/ i ju 


LSISSCLOOL 


195 


NY-ESO-1 138-147 

A ^ Jk fc^kV ^X A A *J A # 


TAADHROLOL 


196 


NY-ESO-1 161-169 


WxTQCFLPV 


197 


NY-ESO-1 157-165 


SLLMWITQC 


198 


NY-ESO-1 150-158 


SSCLQQLSL 


199 


NY-ESO-1 154-162 


QQLSLLMWI 


200 


NY-ESO-1 151-159 


SCLQQLSLL 


201 


NY-ESO-1 150-159 


SSCLQQLSLL 
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202 


NY-ESO-1 163-171 


TOPFLPVFL 


203 


NY-ESO-1 162-171 


FTOPFLPVFL 


204 


PRAMF 210.997 


PMOTjTKMTT 

L iYA J_S AJV1Y1 AAv 


90S 


PR ATVTR 218-927 


IVlA IYXV^A^XAVIYAAAv 


206 


PfcAMF 428-436 


OTTT TCtT QNT 

V^AAJ_<AVJ JLrOlN Aw 


207 


PR AMF 427-436 

ri\nivi r„ / "*tju 


T O TTT TitT QNT 

AvVcXlAwJLVJAvOi^lAv 


908 


PI? ATV/TF 420-436 


HT TOT QMT 


900 


PPAMF 431-430 


TctT QTJT TrTV 

IVJTAwOlN A-» X XI V 


910 


PP AA/TF 430-430 


T THT QNT TTTV 

ivAVJA^Oll Aj 1 XI V 


91 1 


PQA 57 61 


VT VWPflWVT 
V A-» VXIaa^ vv vl 


919 
ZIZ 


DO A <7_61 


n\n VTTPnWVT 
VJ V J-» V AAA W VL 


917 


PQA 57 60 


v_r v Jv v riA w v 


Zl*t 


PQA ^Q_67 


WVT TA ATTPT 


91 S 
Z£ J 




r VTTPnWVT TA 

Li V AAA \^ W VAvlA 


916 
ZiO 


PQA 57_67 


VT VTTPriW\/T T i 
YLV ilry W VL1 


111 
Zl / 


PQA 54-69 


T VTTPnWVT T 


i 918 


PQA fiA_77 
run. OO-/ J 


PTRNTTQVT 


91Q 


DQA 65-77 


HrPTP"MK"QVT 

LAv^AAvTN AVo Y 1 


990 


PQA 56-64 


crpOWVT TA A 

ClA V^W VivlAn. 


991 
zzi 


PQA 67 79 


A AlTPTPXnTQV 


999 

ZZZ 


PQPA 1 1 6-1 97 


r t wnpfjpir 


997 
zzj 


PQPA 1 1 5-1 97 


r t t wnpnni 




PQPA 1 14-197 




995 

ZZJ 


PQPA 00-107 


AT OPA A ATT 


226 


P.QPA 08-107 1 


HAT OPA A A IT 


997 

ZZ/ 


Tvr 1 98-177 
xyr izo-i j / 


A PT71T TM?"T7T7 A V 


99R 


Tvr 1 9Q 1 77 




990 

zzy 


Tw 1 70 1 751 

xyr iju-iJo 


TTVTWtTl? A VT 


910 


TW 171 1 78 


JsJ^AvrrAYL 


971 
ZJl 


Tvr 905-9 1 7 

iyr zuj-zi d 


P A T7T t>TI7TJT> T 
r AT Lr WrUvL 


979 
ZjZ 


Tvr 904 917 


APATTT P1X7TTDT 

ArArLr WJ1KJL 


977 


Tvr 9 1 4-997 
i yr z ih-zzo 


RT T "D\xrErVETfl 
r i-rijiv W xiv^lSlv^ 


974 


Tvr 9 1 9-990 
x yr z l z*zzu 


PT T7T T "PWPH 
ivLfi 1 IvlwJX WLy 


975 
ZjJ 


Tvr 1 01-900 
iyr ijri-zvv 


f»Q T? TWD TiTTiiH 


976 
Z30 


Tvr 1 09-900 
iyr lyz-zuv 




977 
Z J / 


Tin- 477-4R1 
iyr *t / j-h-oi 


Kl W o YV LLKjA 


238 


Tvr 476-484 


cwt T GAAA/TV 

J Yr I tl »VJXVfVlVA V 


239 


Iyr 477-486 


WLLGAAMVGA 


240 


ryr 478-486 


LLGAAMVGA 


241 


PSMA 4-12 


LLHETDSAV 


242 


PSMA 13-21 


\TARRPRWL 


243 


PSMA 53-61 


^KJINMKAF 
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244 


PSMA 64-73 


RLKAENTKKF 


245 


PSMA 69-77 


NTKKFLH 1 NF 


246 


PSMA 68-77 


ENTKKFLH 1 NF 


247 


PSMA 220-228 


AGAKGVBLY 


248 


PSMA 468-477 


PLMYSLVHNL 

*• i—/xyx x » » 1 ■ v xxx i J—/ 


240 


PSMA 469-477 


LMYST VHNT 


250 


PSMA 463-471 


RVDCTPLMY 


251 


PSMA 465-473 


DCTPLMYSL 


252 


PSMA 507-515 


SGMPRISKL 


253 


PSMA 506-515 


FSGMPRISKL 


254 


NY-ESO-1 136-163 


RLTAADHRQLQLSISSCLQQLSLLMWTT 


255 


NY-ESO-1 150-177 


SSCLQQLSLLMWTTQCFLPVFLAQPPSG 



1 This H was reported as Y in the SWISSPROT database. 



Table 1C. SEQ ID NOS * including epitopes in Examplel4. 



SEQ ID NO, 


IDENTJLTi: 


SEQUENCE 


256 


Mage-1 125-132 


KAEMLESV 


OCT 

257 


Mage-1 124-132 


TKAEMLESV 


258 


Mage-1 123-132 


VI KAEMLESV 


259 


Mage-l 128-136 


MLESVIKNY 




A/Ta<w» 1 197 i3£ 
IvldgC-l I^/-1jO 


ElVLLiEiO VJ1SJN I 


261 


Mage-1 125-133 


KAEMLESV! 


262 


Mage-1 146-153 


KASESLQL 


263 


Mage-1 145-153 


GKASESLQL 


264 


Mage-1 147-155 


ASESLQLVF 


265 


Mage-1 153-161 


LVFGIDVKE 


266 


Mage-1 114-121 


LLKYRARE 


267 


Mage-1 106-113 


VADLVGFL 


268 


Mage-1 105-113 


KVADLVGFL 


269 


Mage-1 107-115 


ADLVGFLLL 


270 


Mage-1 106-115 


VADLVGFLLL 


271 


Mage-1 114-123 


LLKYRAREPV 


272 


Mage-3 278-286 


LVETSYVKV 


273 


Mage-3 277-286 
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TWAAGEGIPIFEDFVDSSVGYYTVTGLEPGIt) 
YDISVULINGGESAPTTLTQQT 
AWPFTDLRFIMGPDTMRVTW 

Accession number: X07717 



Accession number: P06731 



Accession number: NM 004363 



Accession number: P04626 



Accession number: Ml 1730 



Accession number: Q 15431 
Accession number: X95654 



Accession number: 060224 



Accession number: NM 005636 



its of the invention. Any of SEQ ID NOS. 10, 
30, 31, 45, 46, 55, 64, 65, 69, 254, and 255 can be useful as sequences containing epitopes or 
epitope clusters, as described in various embodiments of the invention. 

**A11 accession numbers used here and throughout can be accessed through the NCBI databases, 
for example, through the Entrez seek and retrieval system on the world wide web. 

Note that the following discussion sets forth the inventors' understanding of the operation 
of the invention. However, it is not intended that this discussion limit the patent to any particular 
theory of operation not set forth in the claims. 

hi pursuing the development of epitope vaccines others have generated lists of predicted 
epitopes based on MHC binding motifs. Such peptides can be immunogenic, but may not 
correspond to any naturally produced antigenic fragment so that whole antigen will not elicit a 
similar response or sensitize a target cell to cytolysis by CTL. Therefore such lists do not 
differentiate between those sequences that can be useful as vaccines and those that cannot. Efforts 
to determine which of these predicted epitopes are in fact naturally produced have often relied on 
screening their reactivity with tumor infiltrating lymphocytes (TIL). However, TIL are strongly 
biased to recognize immune epitopes whereas tumors (and chronically infected cells) will generally 
present housekeeping epitopes. Thus, unless the epitope is produced by both the housekeeping and 
immune- proteasomes, the target cell will generally not be recognized by CTL induced with 1TL- 
identified epitopes. The epitopes of the present invention, in contrast, are generated by the action a 
specified proteasome, indicating that they can be naturally produced, and enabling their appropriate 
use. The importance of the distinction between housekeeping and immune epitopes to vaccine 
design in more fully set forth in PCT publication WO 01/82963A2. 

The epitopes of the invention include or encode polypeptide fragments of TAAs that are 
precursors or products of proteasomal cleavage by a housekeeping or immune proteasome, and that 
have known or predicted affinity for at least one allele of MHC L In some embodiments, the 
epitopes include or encode a polypeptide of about 6 to 25 amino acids in length, preferably about 7 
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to 20 amino acids in length, more preferably about 8 to 15 amino acids in length, and still more 
preferably 9 or 10 amino acids in length. However, it is understood that the polypeptides can be 
larger as long as they do not contain sequences that cause the polypeptides to be directed away 
from the proteasome or to be destroyed by the proteasome. For immune epitopes, if the larger 
peptides do not contain such sequences, they can be processed in the pAPC by the immune 
proteasome. Housekeeping epitopes may also be embedded in longer sequences provided that the 
sequence is adapted to felicitate liberation of the epitope's G-terminus by action of the 
immunoproteasome. The sequences of these epitopes can be subjected to computer analysis in 
order to calculate physical, biochemical, immunologic, or molecular genetic properties such as 
mass, isoelectric point, predicted mobility in electrophoresis, predicted binding to other MHC 
molecules, melting temperature of nucleic acid probes, reverse translations, similarity or homology 

to other sequences, and the like. 

In constructing the polynucleotides encoding the polypeptide epitopes of the invention, the 
gene sequence of the associated TAA can be used, or the polynucleotide can be assembled from 
any of the corresponding codons. For a 10 amino acid epitope this can constitute on the order of 
10 6 different sequences, depending on the particular amino acid composition. While large, this is a 
distinct and readily definable set representing a miniscule fraction of the >10 18 possible 
polynucleotides of this length, and thus in some embodiments, equivalents of a particular sequence 
disclosed herein encompass such distinct and readily definable variations on the listed sequence. 
In choosing a particular one of these sequences to use in a vaccine, considerations such as codon 
usage, self-complementarity, restriction sites, chemical stability, etc. can be used as will be 

apparent to one skilled in the art. 

The invention contemplates producing peptide epitopes. Specifically these epitopes are 
derived from the sequence of a TAA, and have known or predicted affinity for at least one allele of 
MHC L Such epitopes are typically identical to those produced on target cells or pAPCs. 
Compositions Containing Active Epitopes 

Embodiments of the present invention provide polypeptide compositions, including 
vaccines, therapeutics, diagnostics, pharmacological and pharmaceutical compositions. The 
various compositions include newly identified epitopes of TAAs, as well as variants of these 
epitopes. Other embodiments of the invention provide polynucleotides encoding the polypeptide 
epitopes of the invention. The invention further provides vectors for expression of the polypeptide 
epitopes for purification. In addition, the invention provides vectors for tiie expression of the 
polypeptide epitopes in an APC for use as an anti-tumor vaccine. Any of the epitopes or antigens, 
or nucleic acids encoding the same, from Table 1A can be used. Other embodiments relate to 
methods of making and using the various compositions. 
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A general architecture for a class I MHC-binding epitope can be described, and has been 
reviewed more extensively in Madden, D.R. Annu. Rev. Immunol 13:587-622, 1995. Much of the 
binding energy arises from main chain contacts between conserved residues in the MHC molecule 
and the N- and C-tennini of the peptide. Additional main chain contacts are made but vary among 
5 MHC alleles. Sequence specificity is conferred by side chain contacts of so-called anchor residues 
with pockets that, again, vary among MHC alleles. Anchor residues can be divided into primary 
and secondary. Primary anchor positions exhibit strong preferences for relatively well-defined sets 
of amino acid residues. Secondary positions show weaker and/or less well-defined preferences that 
can often be better described in terms of less favored, rather than more favored, residues. 

10 Additionally, residues in some secondary anchor positions are not always positioned to contact the 

pocket on the MHC molecule at all. Thus, a subset of peptides exists that bind to a particular MHC 
molecule and have a side chain-pocket contact at the position in question and another subset exists 
that show binding to the same MHC molecule that does not depend on the conformation the 
peptide assumes in the peptide-binding groove of the MHC molecule. The C-terminal residue (P ) 

15 is preferably a primary anchor residue. For many of the better studied HLA molecules (e.g. A2, 

A68, B27, B7, B35, and B53) the second position (P2) is also an anchor residue. However, central 
anchor residues have also been observed including P3 and P5 in HLA-B8, as well as P5 and P -3 
in the murine MHC molecules H-2D b and H-2K b , respectively. Since more stable binding will 
generally improve immunogenicity, anchor residues are preferably conserved or optimized in the 

20 design of variants, regardless of their position. 

Because the anchor residues are generally located near the ends of the epitope, the peptide 
can buckle upward out of the peptide-binding groove allowing some variation in length. Epitopes 
ranging from 8-1 1 amino acids have been found for HLA-A68, and up to 13 amino acids for HLA- 
A2. In addition to length variation between the anchor positions, single residue truncations and 

25 extensions have been reported and the N- and C-tennini, respectively. Of the non-anchor residues, 

some point up out of the groove, making no contact with the MHC molecule but being available to 
contact the TCR, very often PI, P4, and P -1 for HLA-A2. Others of the non-anchor residues can 
become interposed between the upper edges of the peptide-binding groove and the TCR, contacting 
both. The exact positioning of these side chain residues, and thus their effects on binding, MHC 

30 fine conformation, and ultimately immunogenicity, are highly sequence dependent. For an epitope 

to be highly immunogenic it must not only promote stable enough TCR binding for activation to 
occur, but the TCR must also have a high enough off-rate that multiple TCR molecules can interact 
sequentially with the same peptide-MHC complex (Kalergis, A.M. et al., Nature Immunol. 2:229- 
234, 2001). Thus without further information about the ternary complex, both conservative and 

35 non-conservative substitutions at these positions merit consideration when designing variants. 
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The polypeptide epitope variants can be made, for example, using any of the techniques 
and guidelines for conservative and non-conservative mutations. Variants can be derived from 
substitution, deletion or insertion of one or more amino acids as compared with the native 
sequence. Amino acid substitutions can be the result of replacing one amino acid with another 
5 amino acid having similar structural and/or chemical properties, such as the replacement of a 

threonine with a serine. Such replacements are referred to as conservative amino acid 
replacements, and all appropriate conservative amino acid replacements are considered to be 
embodiments of one invention. Insertions or deletions can optionally be in the range of about 1 to 
4, preferably 1 to 2, amino acids. It is generally preferable to maintain the "anchor positions" of 

10 the peptide which are responsible for binding to the MHC molecule in question. Indeed, 
' immunogenicity of peptides can be improved in many cases by substituting more preferred residues 
at the anchor positions (Franco, et al., Nature Immunology, 1(2):145-150, 2000). Immunogenicity 
of a peptide can also often be improved by substituting bulkier amino acids for small amino acids 
found in non-anchor positions while maintaining sufficient cross-reactivity with the original 

15 epitope to constitute a useful vaccine. The variation allowed can be determined by routine 

insertions, deletions or substitutions of amino acids in the sequence and testing the resulting 
variants for activity exhibited by the polypeptide epitope. Because the polypeptide epitope is often 
9 amino acids, the substitutions preferably are made to the shortest active epitope, for example, an 
epitope of 9 amino acids. 

20 Variants can also be made by adding any sequence onto the N-terminus of the polypeptide 

epitope variant. Such N-terminal additions can be from 1 amino acid up to at least 25 amino acids. 
Because peptide epitopes are often trimmed by N-terminal exopeptidases active in the pAPC, it is 
understood that variations in the added sequence can have no effect on the activity of the epitope. 
In preferred embodiments, the amino acid residues between the last upstream proteasomal cleavage 

25 site and the N-terminus of the MHC epitope do not include a proline residue. Serwold, T. at al., 

Nature Immunol. 2:644-651, 2001. Accordingly, effective epitopes can be generated from 
precursors larger than the preferred 9-mer class I motif. 

Peptides are useful to the extent that they correspond to epitopes actually displayed by 
MHC I on the surface of a target cell or a pACP. A single peptide can have varying affinities for 

30 different MHC molecules, binding some well, others adequately, and still others not appreciably 

(Table 2). MHC alleles have traditionally been grouped according to serologic reactivity which 
does not reflect the structure of the peptide-binding groove, which can differ among different 
alleles of the same type. Similarly, binding properties can be shared across types; groups based on 
shared binding properties have been termed supertypes. There are numerous alleles of MHC I in 

35 the human population; epitopes specific to certain alleles can be selected based on the genotype of 

the patient. 
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Table 2. 

Predicted Binding of Tyrosmase,in.ii* (SEP ID NO. 1) to Various MHC types 



MHCI type 


♦Half time of 
dissociation (nun) 


Al 


0.05 


A*0201 


1311. 


A*0205 


50.4 


A3 


2.7 


A*l 101 (part of the A3 supcrtype) 


0.012 


A24 


6.0 


B7 


4.0 


B8 


8.0 


B14 (part of the B27 supertype) 


60.0 


B*2702 


0.9 


B*2705 


30.0 


B*3501 (part of the B7 supertype) 


2.0 


B*4403 


0.1 


B*5101 (part oftheB7 supertype) 


26.0 


B*5102 


55.0 


B*5801 


0.20 


B60 


0.40 


B62 


2.0 



*HLA Peptide Binding Predictions (internet http:// access at bimas.dcrtJiih.gov/molbio/hla_.bin) 
5 

In further embodiments of the invention, the epitope, as peptide or encoding 
polynucleotide, can be administered as a vaccine or immunogenic composition, alone or in 
combination with various adjuvants, carriers, or excipients. It should be noted that although the 
term vaccine may be used herein, the discussion can be applied and used with any of the other 

1 0 compositions mentioned herein. Particularly advantageous adjuvants include various cytokines and 

oligonucleotides containing immunostimulatory sequences (as set forth in greater detail in the co- 
pending applications referenced herein). Additionally the polynucleotide encoded epitope can be 
contained in a virus (e.g. vaccinia or adenovirus^) or in a microbial host cell (e.g. Salmonella or 
Listeria monocytogenes) which is then used as a vector for the polynucleotide (Dietrich, G. et al. 

15 Nat. Biotech. 16:181-185, 1998). Alternatively a pAPC can be transformed, ex vivo, to express the 

epitope, or pulsed with peptide epitope, to be itself administered as a vaccine. To increase 
efficiency of these processes, the encoded epitope can be carried by a viral or bacterial vector, or 
complexed with a ligand of a receptor found on pAPC. Similarly the peptide epitope can be 
complexed with or conjugated to a pAPC ligand. A vaccine can be composed of more than a single 

20 epitope. 

Particularly advantageous strategies for incorporating epitopes, and combining them with 
epitope clusters, into a vaccine are disclosed in U.S. Patent Application No. 09/560,465 entitled 
"EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING CELLS," filed on April 28, 
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2000. Epitope clusters for use in connection with this invention are disclosed in U.S. Patent 
Application No. 09/561,571 entitled "EPITOPE CLUSTERS," filed on April 28, 2000. 

Preferred embodiments of the present invention are directed to vaccines and methods for 
causing a pAPC or population of pAPCs to present housekeeping epitopes that correspond to the 
5 epitopes displayed on a particular target cell. Any of the epitopes or antigens in Table 1 A, can be 

used for example. In one embodiment, the housekeeping epitope is a TuAA epitope processed by 
the housekeeping proteasome of a particular tumor type. In another embodiment, the housekeeping 
epitope is a virus-associated epitope processed by the housekeeping proteasome of a cell infected 
with a virus. This facilitates a specific T cell response to the target cells. Concurrent expression 

10 by the pAPCs of multiple epitopes, corresponding to different induction states (pre- and post- 

attack), can drive a CTL response effective against target cells as they display either housekeeping 
epitopes or immune epitopes. 

By having both housekeeping and immune epitopes present on the pAPC, this embodiment 
can optimize the cytotoxic T cell response to a target cell. With dual epitope expression, the 

15 pAPCs can continue to sustain a CTL response to the immune-type epitope when the tumor cell 

switches from the housekeeping proteasome to the immune proteasome with induction by IFN, 
which, for example, may be produced by tumor-infiltrating CTLs. 

In a preferred embodiment, immunization of a patient is with a vaccine that includes a 
housekeeping epitope. Many preferred TAAs are associated exclusively with a target cell, 

20 particularly in the case of infected cells. In another embodiment, many preferred TAAs are the 

result of deregulated gene expression in transformed cells, but are found also in tissues of the testis, 
ovaries and fetus. In another embodiment, useful TAAs are expressed at higher levels in the target 
cell than in other cells. In still other embodiments, TAAs are not differentially expressed in the 
target cell compare to other cells, but are still useful since they are involved in a particular function 

25 of the cell and differentiate the target cell from most other peripheral cells; in such embodiments, 

healthy cells also displaying the TAA may be collaterally attacked by the induced T cell response, 
but such collateral damage is considered to be far preferable to the condition caused by the target 
cell. 

A preferred embodiment of the present invention includes a method of administering a 
30 vaccine including a housekeeping epitope to induce a therapeutic immune response. The vaccine is 

administered to a patient in a manner consistent with the standard vaccine delivery protocols that 
are well known in the art. Methods of administering epitopes of TAAs include, without limitation, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
and mucosal administration. A particularly useful method of vaccine delivery to elicit a CTL 
35 response is disclosed in PCT Publication No. WO 99/01283, entitled "A METHOD OF 

INDUCING A CTL RESPONSE," filed on July 10, 1998. 
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Because the epitope synchronization system has utility in inducing a cell mediated immune 
response, a vaccine to induce a specific T cell response to a target cell is likewise included in a 
preferred embodiment of the present invention. The vaccine contains a housekeeping epitope in a 
concentration effective to cause a pAPC or populations of pAPCs to display housekeeping 
epitopes. Advantageously, the vaccine can include a plurality of housekeeping epitopes or one or 
more housekeeping epitopes optionally in combination with one or more immune epitopes. 
Formulations of the vaccine contain peptides and/or nucleic acids in a concentration sufficient to 
cause pAPCs to present the epitopes. The formulations preferably contain epitopes in a total 
concentration of about l^g-lmg/lOOpl of vaccine preparation. Conventional dosages and dosing 
for peptide vaccines and/or nucleic acid vaccines can be used with the present invention, and such 
dosing regimens are well understood in the art. In one embodiment, a single dosage for an adult 
human may advantageously be from about 1 to about 5000 ^1 of such a composition, administered 
one time or multiple times, e.g., in 2, 3, 4 or more dosages separated by 1 week, 2 weeks, 1 month, 
or more, insulin pump delivers I ul per hour (lowest frequency) ref intranodal method patent. 

The compositions and methods of the invention disclosed herein further contemplate 
incorporating adjuvants into the formulations in order to enhance the performance of the vaccines. 
Specifically, the addition of adjuvants to the formulations is designed to enhance the delivery or 
uptake of the epitopes by the pAPCs. The adjuvants contemplated by the present invention are 
known by those of skill in the art and include, for example, GMCSF, GCSF, IL-2, IL-12, BCG, 
tetanus toxoid, osteopontin, and ETA-1 . 

In some embodiments of the invention, the vaccines can include a recombinant organism, 
such as a virus, bacterium or parasite, genetically engineered to express an epitope in a host. For 
example, Listeria monocytogenes, a gram-positive, facultative intracellular bacterium, is a potent 
vector for targeting TuAAs to the immune system. In a preferred embodiment, this vector can be 
engineered to express a housekeeping epitope to induce therapeutic responses. The normal route of 
infection of this organism is through the gut and can be delivered orally. In another embodiment, 
an adenovirus (Ad) vector encoding a housekeeping epitope for a TuAA can be used to induce anti- 
virus or anti-tumor responses. Bone marrow-derived dendritic cells can be transduced with the 
virus construct and then injected, or the virus can be delivered directly via subcutaneous injection 
into an animal to induce potent T-cell responses. Another embodiment employs a recombinant 
vaccinia virus engineered to encode amino acid sequences corresponding to a housekeeping epitope 
for a TAA Vaccinia viruses carrying constructs with the appropriate nucleotide substitutions in 
the form of a minigene construct can direct the expression of a housekeeping epitope, leading to a 
therapeutic T cell response against the epitope. 

The immunization with DNA requires that APCs take up the DNA and express the encoded 
proteins or peptides. It is possible to encode a discrete class I peptide on the DNA. By 
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immunizing with this construct, APCs can be caused to express a housekeeping epitope, which is 
then displayed on class I MHC on the surface of the cell for stimulating an appropriate CTL 
response. Constructs generally relying on termination of translation or non-proteasomal proteases 
for generation of proper termini of housekeeping epitopes have been described in U.S. Patent 
5 application No. 09/561,572 entitled EXPRESSION VECTORS ENCODING EPITOPES OF 
TARGET-ASSOCIATED ANTIGENS, filed on April 28, 2000. 

It can be desirable to express housekeeping peptides in the context of a larger protein. 
Processing can be detected even when a small number of amino acids are present beyond the 
terminus of an epitope. Small peptide hormones are usually proteolytically processed from longer 

1 0 translation products, often in the size range of approximately 60-120 amino acids. This feet has led 
some to assume that this is the rninimum size that can be efficiently translated. Li some 
embodiments, the housekeeping peptide can be embedded in a translation product of at least about 
60 amino acids. In other embodiments the housekeeping peptide can be embedded in a translation 
product of at least about 50, 30, or 15 amino acids. 

15 Due to differential proteasomal processing, the immune proteasome of the pAPC produces 

peptides that are different from those produced by the housekeeping proteasome in peripheral body 
cells. Thus, in expressing a housekeeping peptide in the context of a larger protein, it is preferably 
expressed in the APC in a context other than its full length native sequence, because, as a 
housekeeping epitope, it is generally only efficiently processed from the native protein by the 

20 housekeeping proteasome, which is not active in the APC. In order to encode the housekeeping 
epitope in a DNA sequence encoding a larger protein, it is useful to find flanking areas on either 
side of the sequence encoding the epitope that permit appropriate cleavage by the immune 
proteasome in order to liberate that housekeeping epitope. Such a sequence ensuring epitope 
synchronization is referred to hereinafter as a SYNCHROTOPE™. Altering flanking amino acid 

25 residues at the N-terminus and C-terrninus of the desired housekeeping epitope can fecilitate 
appropriate cleavage and generation of the housekeeping epitope in the APC. Sequences 
embedding housekeeping epitopes can be designed de novo and screened to determine which can 
be successfully processed by immune proteasomes to liberate housekeeping epitopes. 

Alternatively, another strategy is very effective for identifying sequences allowing 

30 production of housekeeping epitopes in APC. A contiguous sequence of amino acids can be 
generated from head to tail arrangement of one or more housekeeping epitopes. A construct 
expressing this sequence is used to immunize an animal, and the resulting T cell response is 
evaluated to determine its specificity to one or more of the epitopes in the array. By definition, 
these immune responses indicate housekeeping epitopes that are processed in the pAPC effectively. 

35 The necessary flanking areas around this epitope are thereby defined. The use of flanking regions 

of about 4-6 amino acids on either side of the desired peptide can provide the necessary 
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information to facilitate proteasome processing of the housekeeping epitope by the immune 
proteasome. Therefore, a SYNCHROTOPE™ of approximately 16-22 amino acids can be inserted 
into, or fused to, any protein sequence effectively to result in that housekeeping epitope being 
produced in an APC. In alternate embodiments the whole head-to-tail array of epitopes, or just the 
5 epitopes immediately adjacent to the correctly processed housekeeping epitope can be similarly 

transferred from a test construct to a vaccine vector. 

In a preferred embodiment, the housekeeping epitopes can be embedded between known 
immune epitopes, or segments of such, thereby providing an appropriate context for processing. 
The abutment of housekeeping and immune epitopes can generate the necessary context to enable 

10 the immune proteasome to liberate the housekeeping epitope, or a larger fragment, preferably 

including a correct C-terminus. It can be useful to screen constructs to verify that the desired 
epitope is produced. The abutment of housekeeping epitopes can generate a site cleavable by the 
immune proteasome. Some embodiments of the invention employ known epitopes to flank 
housekeeping epitopes in test substrates; in others, screening as described below are used whether 

15 the flanking regions are arbitrary sequences or mutants of the natural flanking sequence, and 

whether or not knowledge of proteasomal cleavage preferences are used in designing the 
substrates. 

Cleavage at the mature N-terminus of the epitope, while advantageous, is not required, 
since a variety of N-terminal trimming activities exist in the cell that can generate the mature N- 

20 terminus of the epitope subsequent to proteasomal processing. It is preferred that such N-terminal 

extension be less than about 25 amino acids in length and it is further preferred that the extension 
have few or no proline residues. Preferably, in screening, consideration is given not only to 
cleavage at the ends of the epitope (or at least at its C-terminus), but consideration also can be 
given to ensure limited cleavage within the epitope. 

25 Shotgun approaches can be used in designing test substrates and can increase the efficiency 

of screening. In one embodiment multiple epitopes can be assembled one after the other, with 
individual epitopes possibly appearing more than once. The substrate can be screened to determine 
which epitopes can be produced. In the case where a particular epitope is of concern a substrate can 
be designed in which it appears in multiple different contexts. When a single epitope appearing in 

30 more than one context is liberated from the substrate additional secondary test substrates, in which 

individual instances of the epitope are removed, disabled, or are unique, can be used to determine 
which are being liberated and truly constitute SYNCHROTOPE™s. 

Several readily practicable screens exist. A preferred in vitro screen utilizes proteasomal 
digestion analysis, using purified immune proteasomes, to determine if the desired housekeeping 

35 epitope can be liberated from a synthetic peptide embodying the sequence in question. The position 

of the cleavages obtained can be determined by techniques such as mass spectrometry, HPLC, and 
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N-terminal pool sequencing; as described in greater detail in U. S. Patent Applications entitled 
METHOD OF EPITOPE DISCOVERY, EPITOPE SYNCHRONIZATION IN ANTIGEN 
PRESENTING CELLS, two Provisional U. S. Patent Applications entitled EPITOPE 
SEQUENCES, which are all cited and incorporated by reference above. 
5 Alternatively, in vivo screens such as immunization or target sensitization can be 

employed. For immunization a nucleic acid construct capable of expressing the sequence in 
question is used. Harvested CTL can be tested for their ability to recognize target cells presenting 
the housekeeping epitope in question. Such targets cells are most readily obtained by pulsing cells 
expressing the appropriate MHC molecule with synthetic peptide embodying the mature 

10 housekeeping epitope. Alternatively, cells known to express housekeeping proteasome and the 

antigen from which the housekeeping epitope is derived, either endogenously or through genetic 
engineering, can be used. To use target sensitization as a screen, CTL, or preferably a CTL clone, 
that recognizes the housekeeping epitope can be used In this case it is the target cell that expresses 
the embedded housekeeping epitope (instead of the pAPC during immunization) and it must 

15 express immune proteasome. Generally, the target cell can be transformed with an appropriate 

nucleic acid construct to confer expression of the embedded housekeeping epitope. Loading with a 
synthetic peptide embodying the embedded epitope using peptide loaded liposomes or a protein 
transfer reagent such as BIOPORTER™ (Gene Therapy Systems, San Diego, CA) represents an 
alternative. 

20 Additional guidance on nucleic acid constructs useful as vaccines in accordance with the 

present invention are disclosed in U.S. Patent Application No. 09/561,572 entitled EXPRESSION 
VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED ANTIGENS," filed on April 
28, 2000. Further, expression vectors and methods for their design, which are useful in accordance 
with the present invention are disclosed in U.S. Patent Application No. 60/336,968 (attorney 

25 docket number CTLIMM.022PR) entitled "EXPRESSION VECTORS ENCODING EPITOPES 

OF TARGET-ASSOCIATED ANTIGENS AND METHODS FOR THEIR DESIGN," filed on 
1 1/7/2001, which is incorporated by reference in its entirety. 

A preferred embodiment of the present invention includes a method of adtninistering a 
vaccine including an epitope (or epitopes) to induce a therapeutic immune response. The vaccine 

30 is administered to a patient in a manner consistent with the standard vaccine delivery protocols that 

are known in the art. Methods of administering epitopes of TAAs including, without limitation, 
transdermal, intranodal, perinodal, oral, intravenous, intradermal, intramuscular, intraperitoneal, 
and mucosal administration, including delivery by injection, instillation or inhalation. A 
particularly useful method of vaccine delivery to elicit a CTL response is disclosed in Australian 

35 Patent No. 739189 issued January 17, 2002; U.S. Patent Application No. 09/380,534, filed on 
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September 1, 1999; and a Contmuation-in-Part thereof U.S. Patent Application No. 09/776,232 
both entitled "A METHOD OF INDUCING A CTL RESPONSE," filed on February 2, 2001 . 
Reagents Recognizing Epitopes 

In another aspect of the invention, proteins with binding specificity for the epitope and the 
5 epitope-MHC molecule complex are contemplated, as well as the isolated cells by which they can 

be expressed. In one set of embodiments these reagents take the form of immunoglobulins: 
polyclonal sera or monoclonal antibodies (mAb), methods for the generation of which are well 
know in the art Generation of mAb with specificity for peptide-MHC molecule complexes is 
known in the art. See, for example, Aharoni et al. Nature 351:147-150, 1991; Andersen et al. Proa 

10 Natl Acad. ScL USA 93:1820-1824, 1996; Dadaglio et al. Immunity 6:727-738, 1997; Due et al. 

Int. Immunol. 5:427-431,1993; Eastman et al. Eur. J. Immunol 26:385-393, 1996; Engberg et al. 
Immunotechnology 4:273-278, 1999; Porgdor et al. Immunity 6:715-726, 1997; Puri et al. J. 
Immunol 158:2471-2476, 1997; and Polakova, K., et al. /. Immunol 165 342-348, 2000; all of 
which are hereby incorporated by reference in their entirety. 

15 hi other embodiments the compositions can be used to induce and generate, in vivo and in 

vitro, T-cells specific for the any of the epitopes, including those listed in Table 1 A, for example. 
Thus, embodiments also relate to and include isolated T cells, T cell clones, T cell hybridomas, or a 
protein containing the T cell receptor (TCR) binding domain derived from the cloned gene, as well 
as a recombinant cell expressing such a protein. Such TCR derived proteins can be simply the 

20 extra-cellular domains of the TCR, or a fusion with portions of another protein to confer a desired 

property or function. One example of such a fusion is the attachment of TCR binding domains to 
the constant regions of an antibody molecule so as to create a divalent molecule. The construction 
and activity of molecules following this general pattern have been reported, for example, Plaksin, 
D. et al. /. Immunol 158:2218-2227, 1997 and Lebowitz, M.S. et al. Cell Immunol 192:175-184, 

25 1999, which are hereby incorporated by reference in their entirety. The more general construction 

and use of such molecules is also treated in U.S. patent 5,830,755 entitled T CELL RECEPTORS 
AND THEIR USE IN THERAPEUTIC AND DIAGNOSTIC METHODS, which is hereby 
incorporated by reference in its entirety. 

The generation of such T cells can be readily accomplished by standard immunization of 

30 laboratory animals, and reactivity to human target cells can be obtained by immunizing with human 

target cells or by immunizing HLA-transgenic animals with the antigen/epitope. For some 
therapeutic approaches T cells derived from the same species are desirable. While such a cell can 
be created by cloning, for example, a murine TCR into a human T cell as contemplated above, in 
vitro immunization of human cells offers a potentially faster option. Techniques for in vitro 

35 immunization, even using naive donors, are know in the field, for example, Stauss et al., Proc. 

Natl Acad. ScL USA 89:7871-7875, 1992; Salgaller et al. Cancer Res. 55:4972-4979, 1995; Tsai et 
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al., J, Immunol 158:1796-1802, 1997; and Chung et al., /. Immunother. 22:279-287, 1999; which 
are hereby incorporated by reference in their entirety. 

Any of these molecules can be conjugated to enzymes, radiochemicals, fluorescent tags, 
and toxins, so as to be used in the diagnosis (imaging or other detection), monitoring, and treatment 
of the pathogenic condition associated with the epitope. Thus a toxin conjugate can be 
administered to kill tumor cells, radiolabeling can facilitate imaging of epitope positive tumor, an 
enzyme conjugate can be used in an ELISA-like assay to diagnose cancer and confirm epitope 
expression in biopsied tissue. In a further embodiment, such T cells as set forth above, following 
expansion accomplished through stimulation with the epitope and/or cytokines, can be 
administered to a patient as an adoptive immunotherapy. 
Reagents Comprising Epitopes 

A further aspect of the invention provides isolated epitope-MHC complexes. In a 
particularly advantageous embodiment of this aspect of the invention, the complexes can be 
soluble, multimeric proteins such as those described in U. S. Patent No. 5,635,363 (tetramers) or U. 
S. Patent No. 6,015,884 (Ig-dimers), both of which are hereby incorporated by reference in their 
entirety. Such reagents are useful in detecting and monitoring specific T cell responses, and in 
purifying such T cells. 

Isolated MHC molecules complexed with epitopic peptides can also be incorporated into 
planar lipid bilayers or liposomes. Such compositions can be used to stimulate T cells in vitro or, in 
the case of liposomes, in vivo. Co-stimulatory molecules (e.g. B7, CD40, LFA-3) can be 
incorporated into the same compositions or, especially for in vitro work, co-stimulation can be 
provided by anti-co-receptor antibodies (e.g. anti-CD28, anti-CD 154, anti-CD2) or cytokines (e.g. 
IL-2, IL-12). Such stimulation of T cells can constitute vaccination, drive expansion of T cells in 
vitro for subsequent infusion in an immuotherapy, or constitute a step in an assay of T cell 
function. 

The epitope, or more directly its complex with an MHC molecule, can be an important 
constituent of functional assays of antigen-specific T cells at either an activation or readout step or 
both. Of the many assays of T cell function current in the art (detailed procedures can be found in 
standard immunological references such as Current Protocols in Immunology 1999 John Wiley & 
Sons Inc., N.Y., which is hereby incorporated by reference in its entirety) two broad classes can be 
defined, those that measure the response of a pool of cells and those that measure the response of 
individual cells. Whereas the former conveys a global measure of the strength of a response, the 
latter allows determination of the relative frequency of responding cells. Examples of assays 
measuring global response are cytotoxicity assays, ELISA, and proliferation assays detecting 
cytokine secretion. Assays measuring the responses of individual cells (or small clones derived 
from them) include limiting dilution analysis (LDA), ELLSPOT, flow cytometric detection of 
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unsecreted cytokine (described in U.S. Patent No. 5,445,939, entitled "METHOD FOR 
ASSESSMENT OF THE MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM" and U.S. Patent 
Nos 5,656,446; and 5,843,689, both entitled "METHOD FOR THE ASSESSMENT OF THE 
MONONUCLEAR LEUKOCYTE IMMUNE SYSTEM," reagents for which are sold by Becton, 
5 Dickinson & Company under the tradename 'FASTIMMUNE', which patents are hereby 
incorporated by reference in their entirety) and detection of specific TCR with tetramers or Ig- 
dimers as stated and referenced above. The comparative virtues of these techniques have been 
reviewed in Yee, C. et al. Current Opinion in Immunology, 13:141-146, 2001, which is hereby 
incorporated by reference in its entirety. Additionally detection of a specific TCR rearrangement 

10 or expression can be accomplished through a variety of established nucleic acid based techniques, 
particularly in situ and single-cell PCR techniques, as will be apparent to one of skill in the art. 

These functional assays are used to assess endogenous levels of immunity, response to an 
immunologic stimulus (e.g. a vaccine), and to monitor immune status through the course of a 
disease and treatment. Except when measuring endogenous levels of immunity, any of these assays 

1 5 presume a preliminary step of immunization, whether in vivo or in vitro depending on the nature of 
the issue being addressed. Such immunization can be carried out with the various embodiments of 
the invention described above or with other forms of immunogen (e.g., pAPC-tumor cell fusions) 
that can provoke similar immunity. With the exception of PCR and tetramer/Ig-dimer type 
analyses which can detect expression of the cognate TCR, these assays generally benefit from a 

20 step of in vitro antigenic stimulation which can advantageously use various embodiments of the 
invention as described above in order to detect the particular functional activity (highly cytolytic 
responses can sometimes be detected directly). Finally, detection of cytolytic activity requires 
epitope-displaying target cells, which can be generated using various embodiments of the 
invention. The particular embodiment chosen for any particular step depends on the question to be 

25 addressed, ease of use, cost, and the like, but the advantages of one embodiment over another for 
any particular set of circumstances will be apparent to one of skill in the art. 
Tumor Associated Antigens 

Epitopes of the present invention are derived from the TuAAs tyrosinase (SEQ ID NO. 2), 
SSX-2, (SEQ ID NO. 3), PSMA (prostate-specific membrane antigen) (SEQ ID NO. 4), GP100, 

30 (SEQ ID NO. 70), MAGE-1, (SEQ ID NO. 71), MAGE-2, (SEQ ID NO. 72), MAGE-3, (SEQ ED 
NO. 73), NY-ESO-1, (SEQ ID NO. 74), PRAME, (SEQ ID NO. 77), PSA, (SEQ ID NO. 78), and 
PSCA, (SEQ ID NO. 79). The natural coding sequences for these eleven proteins, or any segments 
within them, can be determined from their cDNA or complete coding (cds) sequences, SEQ ID 
NOS. 5-7, and 80-87, respectively. 

35 Tyrosinase is a melanin biosynthetic enzyme that is considered one of the most specific 

markers of melanocyte differentiation. Tyrosinase is expressed in few cell types, primarily in 
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melanocytes, and high levels are often found in melanomas. The usefulness of tyrosinase as a 
TuAA is taught in U.S. Patent 5,747,271 entitled "METHOD FOR IDENTIFYING INDIVIDUALS 
SUFFERING FROM A CELLULAR ABNORMALITY SOME OF WHOSE ABNORMAL 
CELLS PRESENT COMPLEXES OF HLA-A2/TYROSINASE DERIVED PEPTIDES, AND 
5 METHODS FOR TREATING SAID INDIVIDUALS" which is hereby incorporated by reference 

in its entirety. 

GP100, also known as PMell7, also is a melanin biosynthetic protein expressed at high 
levels in melanomas. GP100 as a TuAA is disclosed in U.S. Patent 5,844,075 entitled 
"MELANOMA ANTIGENS AND THEIR USE IN DIAGNOSTIC AND THERAPEUTIC 

1 0 METHODS," which is hereby incorporated by reference in its entirety. 

SSX-2, also know as Hom-Mel-40, is a member of a family of highly conserved cancer- 
testis antigens (Gure, A.O. et al. Int. J. Cancer 72:965-971, 1997, which is hereby incorporated by 
reference in its entirety). Its identification as a TuAA is taught in U.S. Patent 6,025,191 entitled 
"ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE A MELANOMA SPECIFIC 

1 5 ANTIGEN AND USES THEREOF," which is hereby incorporated by reference in its entirety. 

Cancer-testis antigens are found in a variety of tumors, but are generally absent from normal adult 
tissues except testis. Expression of different members of the SSX family have been found 
variously in tumor cell lines. Due to the high degree of sequence identity among SSX family 
members, similar epitopes from more than one member of the family will be generated and able to 

20 bind to an MHC molecule, so that some vaccines directed against one member of this family can 
cross-react and be effective against other members of this family (see example 3 below). 

MAGE-1, MAGE-2, and MAGE-3 are members of another family of cancer-testis antigens 
originally discovered in melanoma (MAGE is a contraction of melanoma-associated antigen) but 
found in a variety of tumors. The identification of MAGE proteins as TuAAs is taught in U.S. 

25 Patent 5,342,774 entitled NUCLEOTIDE SEQUENCE ENCODING THE TUMOR REJECTION 

ANTIGEN PRECURSOR, MAGE-1, which is hereby incorporated by reference in its entirety, and 
in numerous subsequent patents. Currently there are 17 entries for (human) MAGE in the SWISS 
Protein database. There is extensive similarity among these proteins so in many cases, an epitope 
from one can induce a cross-reactive response to other members of the family. A few of these have 

30 not been observed in tumors, most notably MAGE-HI and MAGE-D1, which are expressed in 
testes and brain, and bone marrow stromal cells, respectively. The possibility of cross-reactivity on 
normal tissue is ameliorated by the fact that they are among the least similar to the other MAGE 
proteins. 

NY-ESO-1, is a cancer-testis antigen found in a wide variety of tumors, also known as 
35 CTAG-1 (Cancer-Testis Antigen- 1) and CAG-3 (Cancer Antigen-3). NY-ESO-1 as a TuAA is 

disclosed in U.S. Patent 5,804,381 entitled ISOLATED NUCLEIC ACID MOLECULE 



WO 2003/008537 



PCT/US2002/010189 



ENCODING AN ESOPHAGEAL CANCER ASSOCIATED ANTIGEN, THE ANTIGEN ITSELF, 
AND USES THEREOF which is hereby incorporated by reference in its entirety. A paralogous 
locus encoding antigens with extensive sequence identity, LAGE-la/s (SEQ ID NO. 75) and 
LAGE-lb/L (SEQ ID NO. 76), have been disclosed in publicly available assemblies of the human 
5 genome , and have been concluded to arise through alternate splicing. Additionally, CT-2 (or 

CTAG-2, Cancer-Testis Antigen-2) appears to be either an allele, a mutant, or a sequencing 
discrepancy of LAGE-lb/L. Due to the extensive sequence identity, many epitopes from NY-ESO- 
1 can also induce immunity to tumors expressing these other antigens. See figure 1. The proteins 
are virtually identical through amino acid 70. From 71434 the longest run of identities between 

10 NY-ESO-1 and LAGE is 6 residues, but potentially cross-reactive sequences are present. And 
from 135-180 NY-ESO and LAGE-la/s are identical except for a single residue, but LAGE-lb/L is 
unrelated due to the alternate splice. The CAMEL and LAGE-2 antigens appear to derive from the 
LAGE-1 mRNA, but from alternate reading frames, thus giving rise to unrelated protein sequences. 
More recently, GenBank Accession AF277315.5, Homo sapiens chromosome X clone RP5- 

15 865E18, RP5-1087L19, complete sequence, reports three independent loci in this region which are 

labeled as LAGE1 (corresponding to CTAG-2 in the genome assemblies), plus LAGE2-A and 
LAGE2-B (both corresponding to CTAG-1 in the genome assemblies). 

PSMA (prostate-specific membranes antigen), a TuAA described in U.S. Patent 5,538,866 
entitled 'TROSTATE-SPECMC MEMBRANES ANTIGEN" which is hereby incorporated by 

20 reference in its entirety, is expressed by normal prostate epithelium and, at a higher level, in 

prostatic cancer. It has also been found in the neovasculature of non-prostatic tumors. PSMA can 
thus form the basis for vaccines directed to both prostate cancer and to the neovasculature of other 
tumors. This later concept is more fully described in a provisional U.S. Patent application No. 
60/274,063 entitled ANT1-NEOVASCULAR VACCINES FOR CANCER, filed March 7, 2001, 

25 and U.S. Application No. / attorney docket number CTLIMM.015A, filed on March 7, 

2002, entitled "ANTI-NEOVASCULAR PREPARATIONS FOR CANCER," both of which are 
hereby incorporated by reference in their entirety. Alternate splicing of the PSMA mRNA also 
leads to a protein with an apparent start at Met 58 , thereby deleting the putative membrane anchor 
region of PSMA as described in U.S. Patent 5,935,818 entitled "ISOLATED NUCLEIC ACID 

30 MOLECULE ENCODING ALTERNATIVELY SPLICED PROSTATE-SPECIFIC 

MEMBRANES ANTIGEN AND USES THEREOF' which is hereby incorporated by reference in 
its entirety. A protein termed PSMA-like protein, Genbank accession number AF261715, is nearly 
identical to amino acids 309-750 of PSMA and has a different expression profile. Thus the most 
preferred epitopes are those with an N-tenninus located from amino acid 58 to 308. 

35 PRAME, also know as MAPE, DAGE, and OIP4, was originally observed as a melanoma 

antigen. Subsequently, it has been recognized as a CT antigen, but unlike many CT antigens (e.g., 
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MAGE, GAGE, and BAGE) it is expressed in acute myeloid leukemias. PRAME is a member of 
the MAPE family which consists largely of hypothetical proteins with which it shares limited 
sequence similarity. The usefulness of PRAME as a TuAA is taught in U.S. Patent 5,830,753 
entitled "ISOLATED NUCLEIC ACID MOLECULES CODING FOR TUMOR REJECTION 
5 ANTIGEN PRECURSOR DAGE AND USES THEREOF" which is hereby incorporated by 

reference in its entirety. 

PSA, prostate specific antigen, is a peptidase of the kallikrein family and a differentiation 
antigen of the prostate. Expression in breast tissue has also been reported. Alternate names include 
gamma-seminoprotein, kallikrein 3, seminogelase, seminin, and P-30 antigen. PSA has a high 

10 degree of sequence identity with the various alternate splicing products prostatic/glandular 

kallikrein-1 and -2, as well as kalikrein 4, which is also expressed in prostate and breast tissue. 
Other kallikreins generally share less sequence identity and have different expression profiles. 
Nonetheless, cross-reactivity that might be provoked by any particular epitope, along with the 
likelihood that that epitope would be liberated by processing in non-target tissues (most generally 

15 by the housekeeping proteasome), should be considered in designing a vaccine. 

PSCA, prostate stem cell antigen, and also known as SCAH-2, is a differentiation antigen 
preferentially expressed in prostate epithelial cells, and overexpresssed in prostate cancers. Lower 
level expression is seen in some normal tissues including neuroendocrine cells of the digestive tract 
and collecting ducts of the kidney. PSCA is described in U.S. Patent 5,856,136 entitled "HUMAN 

20 STEM CELL ANTIGENS" which is hereby incorporated by reference in its entirety. 

Synaptonemal complex protein 1 (SCP-1), also known as HOM-TES-14, is a meiosis- 
associated protein and also a cancer-testis antigen (Tureci, O., et al. Proc. Natl. Acad. Sci. USA 
95:521 1-5216, 1998). As a cancer antigen its expression is not cell-cycle regulated and it is found 
frequently in gliomas, breast, renal cell, and ovarian carcinomas. It has some similarity to myosins, 

25 but with few enough identities that cross-reactive epitopes are not an immediate prospect. 

The ED-B domain of flbronectin is also a potential target. Fibronectin is subject to 
developmentally regulated alternative splicing, with the ED-B domain being encoded by a single 
exon that is used primarily in oncofetal tissues (Matsuura, H. and S. Hakomori Proc. Natl. Acad. 
Sci. USA 82:6517-6521, 1985; Carnemolla, B. et al. / Cell Biol. 108:1139-1148, 1989; Loridon- 

30 Rosa, B. et al. Cancer ^.50:1608-1612, 1990; Nicolo, G. et al. Cell Differ. Dev. 32:401-408, 

1990; Borsi, L. et al. Exp. Cell Res. 199:98-105, 1992; Oyama, F. et al. Cancer Res. 53:2005-2011, 
1993; Mandel, U. et al. APMIS 102:695-702, 1994; Farnoud, NLR. et al. Int. J. Cancer 61:27-34, 
1995; Pujuguet, P. et al. Am. J. Pathol. 148:579-592, 1996; Gabler, U. et al. Heart 75:358-362, 
1996;Chevalier, X. Br. J. Rheumatol. 35:407-415, 1996; Midulla, M. Cancer Res. 60:164-169, 

35 2000). 
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The ED-B domain is also expressed in fibronectin of the neovasculature (Kaczmarek, J. et 
al. Int. J. Cancel' 59:11-16, 1994; Castellani, P. et al. Int. J. Cancer 59:612-618, 1994; Neri, D. et 
al. Nat Biotech 15:1271-1275, 1997; Karelina, T.V. and A.Z. Eisen Cancer Detect Prev. 22:438- 
444, 1998; Tarli, L. et al. Blood 94:192-198, 1999; Castellani, P. et al. Acta Neurockir. (Wien) 

5 142:277-282, 2000). As an oncofetal domain, the ED-B domain is commonly found in the 

fibronectin expressed by neoplastic cells in addition to being expressed by the neovasculature. 
Thus, CTL-inducing vaccines targeting the ED-B domain can exhibit two mechanisms of action: 
direct lysis of tumor cells, and disruption of the tumor's blood supply through destruction of the 
tumor-associated neovasculature. As CTL activity can decay rapidly after withdrawal of vaccine, 

1 0 interference with normal angiogenesis can be minimal. The design and testing of vaccines targeted 

to neovasculature is described in Provisional U.S. Patent Application No. 60/274,063 entitled 
"ANTI-NEOVASCULATURE VACCINES FOR CANCER" and in U.S. Patent Application No. 

j attorney docket number CTLIMM.015A, entitled "ANTI-NEOVASCULATURE 

PREPARATIONS FOR CANCER, filed on date even with this application (March 7, 2002). A 

1 5 tumor cell line is disclosed in Provisional U.S. Application No. _/ , filed on March 7, 2002, 

attorney docket number CTLIMM.028PR, entitled "HLA-TRANSGENIC MURINE TUMOR 
CELL LINE," which is hereby incorporated by reference in its entirety. 

Carcinoembiyonic antigen (CEA) is a paradigmatic oncofetal protein first described in 
1965 (Gold and Freedman, J. Exp. Med. 121: 439-462, 1965. Fuller references can be found in the 

20 Online Medelian Inheritance in Man; record * 114890). It has officially been renamed 

carcinoembryonic antigen-related cell adhesion molecule 5 (CEACAM5). Its expression is most 
strongly associated with adenocarcinomas of the epithelial lining of the digestive tract and in fetal 
colon. CEA is a member of the immunoglobulin supergene family and the defining member of the 
CEA subfamily. 

25 HER2/NEU is an oncogene related to the epidermal growth factor receptor (van de Vijver, 

et al., New Eng. J. Med. 319:1239-1245, 1988), and apparently identical to the C-ERBB2 oncogene 
(Di Fiore, et al., Science 237: 178-182, 1987). The over-expression of ERBB2 has been implicated 
in the neoplastic transformation of prostate cancer. As HER2 it is amplified and over-expressed in 
25-30% of breast cancers among other tumors where expression level is correlated with the 

30 aggressiveness of the tumor (Slamon, et al., New Eng. J. Med. 344:783-792, 2001). A more detailed 

description is available in the Online Medelian Inheritance in Man; record *164870. 

All references mentioned herein are hereby incorporated by reference in their entirety. 
Further, incorporated by reference in its entirety is U.S. Patent Application No. 10/005,905 
(attorney docket number CTLIMM.02 1 CP 1 ) entitled "EPITOPE SYNCHRONIZATION IN 

35 ANTIGEN PRESENTING CELLS," filed on November 7, 2001 and a continuation thereof, U.S. 

Application No. _/ , filed on December 7, 2000, attorney docket number 
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CTLIMM.21CP1C, also entitled "EPITOPE SYNCHRONIZATION IN ANTIGEN PRESENTING 
CELLS." 

Useful epitopes were identified and tested as described in the following examples. 
However, these examples are intended for illustration purposes only, and should not be construed 
5 as limiting the scope of the invention in any way. 

EXAMPLES 

Sequences of Specific Preferred Epitopes 

♦ 

Example 1 

1 0 Manufacture of tyrosinase epitopes. 

A. Synthetic production of epitopes 

Peptides having an amino acid sequence of any of SEQ ID NO: 1, 8, 9, 1 1-23, 2-29, 32-44, 
47-54, 56-63, 66-68 88-253, or 256-588 are synthesized using either FMOC or tBOC solid phase 
synthesis methodologies. After synthesis, the peptides are cleaved from their supports with either 

15 trifluoroacetic acid or hydrogen fluoride, respectively, in the presence of appropriate protective 

scavengers. After removing the acid by evaporation, the peptides are extracted with ether to 
remove the scavengers and the crude, precipitated peptide is then lyophilized. Purity of the crude 
peptides is determined by HPLC, sequence analysis, amino acid analysis, counterion content 
analysis and other suitable means. If the crude peptides are pure enough (greater than or equal to 

20 about 90% pure), they can be used as is. If purification is required to meet drug substance 

specifications, the peptides are purified using one or a combination of the following: re- 
precipitation; reverse-phase, ion exchange, size exclusion or hydrophobic interaction 
chromatography; or counter-current distribution. 
Drug product formulation 

25 GMP-grade peptides are formulated in a parenterally acceptable aqueous, organic, or 

aqueous-organic buffer or solvent system in which they remain both physically and chemically 
stable and biologically potent Generally, buffers or combinations of buffers or combinations of 
buffers and organic solvents are appropriate. The pH range is typically between 6 and 9. Organic 
modifiers or other excipients can be added to help solubilize and stabilize the peptides. These 

30 include detergents, lipids, co-solvents, antioxidants, chelators and reducing agents. In the case of a 

lyophilized product, sucrose or mannitol or other lyophilization aids can be added. Peptide 
solutions are sterilized by membrane filtration into their final container-closure system and either 
lyophilized for dissolution in the clinic, or stored until use. 

B. Construction of expression vectors for use as nucleic acid vaccines 

35 The construction of three generic epitope expression vectors is presented below. The 

particular advantages of these designs are set forth in U.S. Patent Application No. 09/561,572 
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entitled "EXPRESSION VECTORS ENCODING EPITOPES OF TARGET-ASSOCIATED 
ANTIGENS," which has been incorporated by reference in its entirety above. 

A suitable E. coli strain was then transfected with the plasmid and plated out onto a 
selective medium. Several colonies were grown up in suspension culture and positive clones were 
5 identified by restriction mapping. The positive clone was then grown up and aliquotted into 

storage vials and stored at -70°C. 

A mini-prep (QIAprep Spin Mini-prep: Qiagen, Valencia, CA) of the plasmid was then 
made from a sample of these cells and automated fluorescent dideoxy sequence analysis was used 
to confirm that the construct had the desired sequence. 
10 B.l Construction of nVAX-EPl-IRES-EP2 

Overview: 

The starting plasmid for this construct is pVAXl purchased from Invitrogen (Carlsbad, 
CA). Epitopes EP1 and EP2 were synthesized by GIBCO BRL (Rockville, MD). The IRES was 
excised from pIRES purchased from Clontech (Palo Alto, CA). 
15 Procedure: 

1 pIRES was digested with EcoRI and NotL The digested fragments were separated by 
agarose gel electrophoresis, and the IRES fragment was purified from the excised band. 

2 pVAXl was digested with EcoRI and NotI, and the pVAXl fragment was gel-purified. 

3 The purified pVAXl and IRES fragments were then ligated together. 

20 4 Competent E. coli of strain DH5oc were transformed with the ligation mixture. 

5 Minipreps were made from 4 of the resultant colonies. 

6 Restriction enzyme digestion analysis was performed on the miniprep DNA. One 
recombinant colony having the IRES insert was used for further insertion of EP1 and EP2. 
This intermediate construct was called p VAX-IRES. 

25 7 Oligonucleotides encoding EP1 and EP2 were synthesized. 

8 EP1 was subcloned into' pV AX-IRES between Afin and EcoRI sites, to make pVAX- 
EP1-IRES; 

9 EP2 was subcloned into pVAX-EPl-IRES between Sail and NotI sites, to make the 
final construct p V AX-EP 1 -IRES-EP2 . 

30 10 The sequence of the EP 1 -IRES-EP2 insert was confirmed by DNA sequencing. 

B 2. Construction of n VAX-EP 1 -IRES-EP2-ISS-NIS 
Overview: 

The starting plasmid for this construct was pV AX-EP 1 -ERES-EP2 (Example 1). The ISS 
(immunostimulatory sequence) introduced into this construct is AACGTT, and the NIS (standing 
35 for nuclear import sequence) used is the SV40 72bp repeat sequence. ISS-NIS was synthesized by 

GIBCO BRL. See Figure 2. 
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Procedure: 

1 p VAX-EP I -IRES -EP2 was digested with Nrul; the linearized plasmid was gel-purified. 

2 ISS-NIS oligonucleotide was synthesized. 

3 The purified linearized p VAX-EP 1 -IRES-EP2 and synthesized ISS-NIS were ligated 
5 together. 

4 Competent E. coli of strain DH5a were transformed with the ligation product 

5 Minipreps were made from resultant colonies. 

6 Restriction enzyme digestions of the minipreps were carried out. 

7 The plasmid with the insert was sequenced. 
10 B3. Construction of PVAX-EP2-UB-EP1 

Overview: 

The starting plasmid for this construct was pVAXl (Invitrogen). EP2 and EP1 were 
synthesized by GIBCO BRL. Wild type Ubiquitin cDNA encoding the 76 amino acids in the 
construct was cloned from yeast. 
15 Procedure: 

1 RT-PCR was performed using yeast mRNA. Primers were designed to amplify the 
complete coding sequence of yeast Ubiquitin. 

2 The RT-PCR products were analyzed using agarose gel electrophoresis. A band with the 
predicted size was gel-purified. 

20 3 The purified DNA band was subcloned into pZEROl at EcoRV site. The resulting clone 

was named pZERO-UB. 

4 Several clones of pZERO-UB were sequenced to confirm the Ubiquitin sequence before 
further manipulations. 

5 EP 1 and EP2 were synthesized. 

25 6 EP2, Ubiquitin and EP1 were ligated and the insert cloned into pVAXl between BamHI 

and EcoRI, putting it under control of the CMV promoter. 
7 The sequence of the insert EP2-UB-EP1 was confirmed by DNA sequencing. 
Example 2 

Identification of useful epitope variants. 
30 The 10-mer FLPWHRLFLL (SEQ ID NO. 1) is identified as a useful epitope. Based on 

this sequence, numerous variants are made. Variants exhibiting activity in HLA binding assays (see 

Example 3, section 6) are identified as useful, and are subsequently incorporated into vaccines. 

The HLA-A2 binding of length variants of FLPWHRLFLL have been evaluated. 

Proteasomal digestion analysis indicates that the C-terminus of the 9-mer FLPWHRLFL (SEQ ID 
35 NO. 8) is also produced. Additionally the 9-mer LPWHRLFLL (SEQ ID NO. 9) can result from N- 

terminal trimming of the 10-mer. Both are predicted to bind to the HLA-A*0201 molecule, 
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however of these two 9-mers, FLPWHRLFL displayed more significant binding and is preferred 
(see Figs. 3A and B). 

Sequence variants of FLPWHRLFL are constructed as follow. Consistent with the binding 
coefficient table (see Table 3) from the MH/BIMAS MHC binding prediction program (see 
5 reference in example 3 below), binding can be improved by changing the L at position 9, an anchor 
position, to V. Binding can also be altered, though generally to a lesser extent, by changes at non- 
anchor positions. Referring generally to Table 3, binding can be increased by employing residues 
with relatively larger coefficients. Changes in sequence can also alter immunogenicity 
independently of their effect on binding to MHC. Thus binding and/or immunogenicity can be 

1 0 improved as follows: 

By substituting FJL,M,W, or Y for P at position 3; these are all bulkier residues that can 
also improve immunogenicity independent of the effect on binding. The amine and hydroxyl- 
bearing residues, Q and N; and S and T; respectively, can also provoke a stronger, cross-reactive 
response. 

15 By substituting D or E for W at position 4 to improve binding; this addition of a negative 

charge can also make the epitope more immunogenic, while in some cases reducing cross-reactivity 
with the natural epitope. Alternatively the conservative substitutions of F or Y can provoke a 

cross-reactive response. 

By substituting F for H at position 5 to improve binding. H can be viewed as partially 
20 charged, thus in some cases the loss of charge can hinder cross-reactivity. Substitution of the fully 
charged residues R or K at this position can enhance immunogenicity without disrupting charge- 
dependent cross-reactivity. 

By substituting I, L, M, V, F, W, or Y for R at position 6. The same caveats and 

alternatives apply here as at position 5. 
25 By substituting W or F for L at position 7 to improve binding. Substitution of V, I, S, T, Q, 

or N at this position are not generally predicted to reduce binding affinity by this model (the NIH 

algorithm), yet can be advantageous as discussed above. 

Y and W, which are equally preferred as the Fs at positions 1 and 8, can provoke a useful 

pross-reactivity. Finally, while substitutions in the direction of bulkiness are generally favored to 
30 improve immunogenicity, the substitution of smaller residues such as A, S, and C, at positions 3-7 

can be useful according to the theory that contrast in size, rather than bulkiness per se, is an 

important factor in immunogenicity. The reactivity of the thiol group in C can introduce other 

properties as discussed in Chen, J.-L., et al. /. Immunol. 165:948-955, 2000. 
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Table 3. 9-mer Coefficient Table for HT ,A-A*n2Ql* 



HLA Coefficient table for file "AJ)201_standar<f 




10 



15 



*This table and other comparable data that are publicly available are useful in designing epitope 
variants and in detennining whether a particular variant is substantially similar, or is functionally 
similar. 
Example 3 

Cluster Analysis (SSX-2 3J - 68 ). 

1. p pitope cluster region prediction : 

The computer algorithms: SYFPEUHI (internet http:// access at syfpeithi.bmi- 
heideltergxom/Scripts/M^ based on the book "MHC Ligands and 

Peptide Motifs" by H.G.Rammensee, J.Bachmarm and S.Stevanovic; and HLA Peptide Binding 
Predictions (NIH) (internet http:// access at bimas.dcrt.rdh.gov/molbio/hla._bin), described in 
Parker, K. C, et al, 7. Immunol. 152:163, 1994; were used to analyze the protein sequence of 
SSX-2 (GL 10337583). Epitope clusters (regions with higher than average density of peptide 
fragments with high predicted MHC affinity) were defined as described fully in U.S. Patent 
Application No. 09/561,571 entitled "EPITOPE CLUSTERS," filed on April 28, 2000. Using a 
epitope density ratio cutoff of 2, five and two clusters were defined using the SYFPETHI and NIH 
algorithms, respectively, and peptides score cutoffs of 16 (SYFPETHI) and 5 (NIH). The highest 
scoring peptide with the NIH algorithm, SSX-24M9, with an estimated halftime of dissociation of 
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>1000 min., does not overlap any other predicted epitope but does cluster with SSX-2j7. 65 in the 
NIH analysis. 

2. peptide synthesis and characterization : 

SSX-2,,.* YFSKEEWEKMKASEKIFYVYMKRKYEAM (SEQID NO. 

10) was synthesized by MPS (Multiple Peptide Systems, San Diego, CA 92121) using standard 
solid phase chemistry. According to the provided 'Certificate of Analysis', the purity of this 

peptide was 95%. 

3. Proteasome digestion : 

Proteasome was isolated from human red blood cells using the proteasome isolation 
protocol described in U.S. Patent Application No. 09/561,074 entitled 'METHOD OF EPITOPE 
DISCOVERY," filed on April 28, 2000. SDS-PAGE, western-blotting, and ELISA were used as 
quality control assays. The final concentration of proteasome was 4 mg/ml, which was determined 
by non-interfering protein assay (Geno Technologies Inc.). Proteasomes were stored at -70°C in 25 
ul aliquots. 

SSX-2 3H « was dissolved in Milli-Q water, and a 2 mM stock solution prepared and 20uL 

aliquots stored at -20°C. 

1 tube of proteasome (25 uL) was removed from storage at-70°C and thawed on ice. It was 
then mixed thoroughly with 12.5uL of 2mM peptide by repipetting (samples were kept on ice). A 
5\xL sample was immediately removed after mixing and transferred to a tube containing 1.25uL 
10%TFA (final concentration of TFA was 2%); the T=0 min sample. The proteasome digestion 
reaction was then started and carried out at 37°C in a programmable thermal controller. Additional 
5uL samples were taken out at 15, 30, 60, 120, 180 and 240 min respectively, the reaction was 
stopped by adding the sample to 1 .25uX 10% TFA as before. Samples were kept on ice or frozen 
until being analyzed by MALDI-MS. All samples were saved and stored at -20°C for HPLC 
analysis and N-tenninal sequencing. Peptide alone (without proteasome) was used as a blank 
control: 2 uL peptide + 4uL Tris buffer (20 mM, pH 7.6) + 1.5uL TFA 

4. MATT)T-T OF MS measurements : 

For each time point 0.3 uL of matrix solution (lOmg/ml a-cyano-4-hydroxycinnamic acid 
in AcCN/H 2 0 (70:30)) was first applied on a sample slide, and then an equal volume of digested 
sample was mixed gently with matrix solution on the slide. The slide was allowed to dry at 
ambient air for 3-5 min. before acquiring the mass spectra. MS was performed on a Lasermat 2000 
MALDI-TOF mass spectrometer that was calibrated with peptide/protein standards. To improve 
the accuracy of measurement, the molecular ion weight (MH 4 ) of the peptide substrate was used as 
an internal calibration standard. The mass spectrum of the T=120 min. digested sample is shown in 
figure 4. 
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5. MS data analysis and epi tope, identification: 

To assign the measured mass peaks, the computer program MS-Product, a tool from the 
UCSF Mass Spectrometry Facility (http:// accessible at 
prospector.ucsf.edu/ucsfhtml3.4/msprod.htm), was used to generate all possible fragments (N- and 
5 C-teiminal ions, and internal fragments) and their corresponding molecular weights. Due to the 

sensitivity of the mass spectrometer, average molecular weight was used. The mass peaks 
observed over the course of the digestion were identified as summarized in Table 4. 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
1 0 steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 

in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 5. 
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Table 4.SSX-V<oMass Peak Identification. 




Boldface sequence correspond to peptides predicted to bind to MHC. 

* On the basis of mass alone tins peak could also have been assigned to the peptide 32-50, however 
proteasomal removal of just the N-terminal amino acid is unlikely. N-terminal sequencing (below) 
verifies the assignment to 31-49. . . 

** On the basis of mass this fragment might also represent 33-68. N-termmal sequencing below is 

consistent with the assignment to 31-65. 
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Table 5. PredtrtAri W A binding bv nroteasomallv generated fragments 



SEO ID NO. 


PEPTIDE 


HLA 


oir JrJLlltu 




11 


FSKEEWEKM 


B*3501 


NPf 


90 


12 


KMKASEKIF 


B*08 


17 


<5 


13 & (14) 


(K) MKASEKIFY 


Al 


19(19) 


<5 


,15 &(16) 


(M) KASEKIFYV 


A*0201 


22 (16) 


1017 


B*08 


17 


<5 


B*5101 


22 (13) 


60 


B*5102 


NP 


133 


B*5103 


NP 


121 


17 & (18) 


(K)ASEKIFYVY 


Al 


34 (19) 


14 


19 & (20) 


( K) RKYEAMTKL 


A*0201 


15 


<5 


A26 


15 


NP 


B14 


NP 


45 (60) 


B*2705 


21 




B*2709 


16 


NP 


B*5101 


15 


<5 


21 

22 
23 


KYEAMTKLGF 

YEAMTKLGF 
EAMTKLGF 


Al 


16 


<5 


A24 


NP 


300 


B*4403 


NP 


80 


B*08 


22 


<5 



fNo prediction 



5 

As seen in Table 5, N-tenninal addition of authentic sequence to epitopes can generate epitopes for 
the same or different MHC restriction elements. Note in particular the pairing of 
(K)RKYEAMTKL (SEQ ED NOS 19 and (20)) with HLA-B14, where the 10-mer has a longer 

10 predicted halftime of dissociation than the conterminal 9-mer. Also note the case of the 10-mer 
KYEAMTKLGF (SEQ ID NO. 21) which can be used as a vaccine useful with several MHC types 
by relying on N-terminal trimming to create the epitopes for HLA-B*4403 and -B*08. 
6. HLA-A0201 binding assay : 

Binding of the candidate epitope KASEKIFYV, SSX-2 4 m 9 , (SEQ ID NO. 15) to HLA- 

15 A2.1 was assayed using a modification of the method of Stauss et al., (Proc Natl Acad Sci USA 
89(1 7):787 1-5 (1992)). Specifically, T2 cells, which express empty or unstable MHC molecules 
on their surface, were washed twice with Iscove's modified Dulbecco's medium (IMDM) and 
cultured overnight in serum-free AIM-V medium (Life Technologies, Inc., Rockville, MD) 
supplemented with human B2-microglobulin at 3ug/ml (Sigma, St. Louis, MO) and added peptide, 
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at 800, 400, 200, 100, 50, 25, 12.5, and 6.25 ug/ml.in a 96-well flat-bottom plate at 3x10 s cells/200 
ul/well. Peptide was mixed with the cells by repipeting before distributing to the plate 
(alternatively peptide can be added to individual wells), and the plate was rocked gently for 2 
minutes. Incubation was in a 5% CO2 incubator at 37°C. The next day the unbound peptide was 
removed by washing twice with serum free RPJVfl medium and a saturating amount of anti-class I 
HLA monoclonal antibody, fluorescein isothiocyanate (FlTC)-conjugated anti-HLA A2 5 A28 (One 
Lambda, Canoga Park, CA) was added. After incubation for 30 minutes at 4°C, cells were washed 
3 times with PBS supplemented with 0.5% BSA, 0.05%(w/v) sodium azide, pH 7.4-7.6 (staining 
buffer). (Alternatively W6/32 (Sigma) can be used as the anti-class I HLA monoclonal antibody 
the cells washed with staining buffer and then incubated with fluorescein isothiocyanate (FTTC)- 
conjugated goat F(ab') antimouse-IgG (Sigma) for 30 min at 4°C and washed 3 times as before.) 
The cells were resuspended in 0.5 ml staining buffer. The analysis of surface HLA-A2.1 molecules 
stabilized by peptide binding was performed by flow cytometry using a FACScan (Becton 
Dickinson, San Jose, CA). If flow cytometry is not to be performed immediately the cells can be 
fixed by adding a quarter volume of 2% paraformaldehyde and storing in the dark at 4 C. 

The results of the experiment are shown in Figure 5. SSX-2 4 m 9 (SEQ ID NO. 15) was 
found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV 18 . 27 ; 
SEQ ID NO: 24) used as a positive control. An HLA-B44 binding peptide, AEMGKYSFY (SEQ 
ID NO: 25), was used as a negative control. The fluorescnce obtained from the negative control 
was similar to the signal obtained when no peptide was used in the assay. Positive and negative 
control peptides were chosen from Table 18.3.1 in Current Protocols in Immunology p. 18.3.2, 
John Wiley and Sons, New York, 1998. 
7. hnmunogenicitv : 

A. In vivo immunization of mice . 

HHD1 transgenic A*0201 mice (Pascolo, S., et al. J. Exp. Med. 185:2043-2051, 1997) 
were anesthetized and injected subcutaneously at the base of the tail, avoiding lateral tail veins, 
using 100 ul containing 100 nmol of SSX-2 4 m 9 (SEQ ID NO. 15) and 20 ug of HTL epitope 
peptide in PBS emulsified with 50 ul oflFA (incomplete Freund's adjuvant). 

B. Preparation of stimulating cells (LP S blasts). 

Using spleens from 2 naive mice for each group of immunized mice, un-immunized mice 
were sacrificed and the carcasses were placed in alcohol. Using sterile instruments, the top dermal 
layer of skin on the mouse's left side (lower mid-section) was cut through, exposing the 
peritoneum. The peritoneum was saturated with alcohol, and the spleen was aseptically extracted. 
The spleen was placed in a petri dish with serum-free media. Splenocytes were isolated by using 
sterile plungers from 3 ml syringes to mash the spleens. Cells were collected in a 50 ml conical 
tubes in serum-free media, rinsing dish well. Cells were centrifuged (12000 rpm, 7 min) and 
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washed one time with RPML Fresh spleen cells were resuspended to a concentration of lxl 0 6 
cells per ml in RPMI-10%FCS (fetal calf serum). 25g/ml lipopolysaccharide and 7 ug/ml Dextran 
Sulfate were added Cell were incubated for 3 days in T-75 flasks at 37°C, with 5% C0 2 . Splenic 
blasts were collected in 50 ml tubes pelleted (12000 rpm, 7 min) and resuspended to 3X10 7/ ml in 
5 RPML The blasts were pulsed with the priming peptide at 50 ug/ml, RT 4hr. mitomycin C-treated 
at 25ug/ml, 37°C, 20 min and washed three times with DMEM. 

C. In vitro stimulation . 

3 days after LPS stimulation of the blast cells and the same day as peptide loading, the 
primed mice were sacrificed (at 14 days post immunization) to remove spleens as above. 3xl0 6 
10 splenocytes were co-cultured with lxlO 6 * LPS blasts/well in 24-well plates at 37°C, with 5% CO2 
in DMEM media supplemented with 10% FCS, 5xl0" 5 M p-mercaptoethanol, lOOug/ml 
streptomycin and 100 IU/ml penicillin. Cultures were fed 5% (vol/vol) ConA supernatant on day 3 
and assayed for cytolytic activity on day 7 in a 5lQ>release assay. 

D. Chromium-release assay measuring CTL activity . 

15 To assess peptide specific lysis, 2xl0 6 T2 cells were incubated with 100 \xd sodium 

chromate together with 50 ug/ml peptide at 37 C for 1 hour. During incubation they were 
gently shaken every 15 minutes. After labeling and loading, cells were washed three times with 10 
ml of DMEM-10% FCS, wiping each tube with a fresh Kimwipc after pouring off the supernatant. 
Target cells were resuspended in DMEM-10% FBS lxl0 5 /ml. Effector cells were adjusted to 

20 lxl0 7 /ml in DMEM-10% FCS and 100 ul serial 3-fold dilutions of effectors were prepared in U- 

bottom 96-well plates. 100 \xl of target cells were added per well. In order to determine 
spontaneous release and maximum release, six additional wells containing 100 ul of target cells 
were prepared for each target Spontaneous release was revealed by incubating the target cells with 
100 ul medium; maximum release was revealed by incubating the target cells with lOOjil of 2% 

25 SDS. Plates were then centrifuged for 5 min at 600 rpm and incubated for 4 hours at 37°C in 5% 

CO2 and 80% humidity. After the incubation, plates were then centrifuged for 5 min at 1200 rpm. 
Supematants were harvested and counted using a gamma counter. Specific lysis was determined as 
follows: % specific release = [(experimental release - spontaneous release)/ (maximum release - 
spontaneous release)] x 100. 

30 Results of the chromium release assay demonstrating specific lysis of peptide pulsed target 

cells are shown in figure 6. 

8. Cross-reactivitv with other SSX proteins : 

SSX-2 4M9 (SEQ ID NO. 15) shares a higji degree of sequence identity with the same region 
of the other SSX proteins. The surrounding regions have also been generally well conserved. Thus 
35 the housekeeping proteasome can cleave following V 49 in all five sequences. Moreover, SSX41-49 is 
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predicted to bind HLA-A*0201 (see Table 6). CTL generated by immunization wi& SSX-2 4M9 
cross-react with tumor cells expressing other SSX proteins. 
Tahlpfi. SSX 1H o- A*0 201 Predicted Binding 



SEQ ID NO. 


Family 
Member 


Sequence 


sYFPErrm 

Score 


nih 

Score 


15 


SSX-2 


KASEKIFYV 


22 


1017 


26 


SSX-1 


KYSEKISYV 


18 


1.7 


27 


SSX-3 


KVSEKIVYV 


24 


1105 


28 


SSX-4 


KSSEKIVYV 


20 


82 


29 


SSX-5 


KASEKHYV 


22 


175 



Example 4 

Cluster Analysis (PSMAkmsO- 

A peptide, AFSPQGMPEGDLVYVNYARTEDFFKLERDM, PSMA 163 . 1W , (SEQ ID NO. 

30), containing an Al epitope cluster from prostate specific membrane antigen, PSMA168-190 (SEQ 

ID NO. 31) was synthesized using standard solid-phase F-moc chemistry on a 433A ABI Peptide 

synthesizer. After side chain deprotection and cleavage from the resin, peptide first dissolved in 

formic acid and then diluted into 30% Acetic acid, was run on a reverse-phase preparative HPLC 

C4 column at following conditions: linear AB gradient ( 5% B/min) at a flow rate of 4 ml/min, 

where eluent A is 0.1% aqueous TFA and eluent B is 0.1% TFA in acetonitrile. A fraction at time 

16.642 min containing the expected peptide, as judged by mass spectrometry, was pooled and 

lyophilized. The peptide was then subjected to proteasome digestion and mass spectrum analysis 

essentially as described above. Prominent peaks from the mass spectra are summarized in Table 7. 

Table 7. PSMAi^oi Mass Peak Identification. 



PEPTIDE 


SEQUENCE 


CALCULATE 
D MASS (Mtf) 


163-177 


AFSPQGMPEGDLVYV 


1610.0 


178-189 


NYARTBDFFKLE 


1533.68 


170-189 


PEGDLVYVNYARTEDFFKLE 


2406.66 


178-191 


NYARTEDFFKLERD 


1804.95 


170-191 


PEGDLVYVNYARTEDFFKLERD 


2677.93 


178-192 


NYARTEDFFKLERDM 


1936.17 


163-176 


AFSPQGMPEGDLVY 


1511.70 


177-192 


VNYARTEDFFKLERDM 


2035.30 


163-179 


AFS PQGMPEGDLVYVNY 


1888.12 
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180-192 


ARTEDFFKLERDM 


1658.89 


163-183 


AFSPQGMPEGDLVYVNYARTE 


2345.61 


184-192 


DFFKLERDM 


1201.40 


176-192 


YVNYARTEDFFKLERDM 


2198.48 


167-185 


QGMPEGDLVYVNYARTEDF 


2205.41 


178-186 


NYARTEDFF 


1 163.22 



10 



15 



20 



25 



N-terminal Fool Sequence Analysis 

One aliquot at one Hour of the proteasomal digestion (see Example 3 part 3 above) was 
subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (Applied 
Biosystems, Foster City, CA). Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
N-terminal direction, hi addition to helping resolve any ambiguity in the assignment of mass to 
sequences, these data also provide a more reliable indication of the relative yield of the various 
fragments than does mass spectrometry. 

For PSMAi 6 3. 192 (SEQ ID NO. 30) this pool sequencing supports a single major cleavage 
site after V 177 and several minor cleavage sites, particularly one after Y 179 . Reviewing the results 
presented in figures 7A-C reveals the following: 

S at the 3" 1 cycle indicating presence of theN-tenninus of the substrate. 

Q at the 5* cycle indicating presence of the N-ternrinus of the substrate. 

N at the 1 st cycle indicating cleavage after Vm- 

N at the 3 rd cycle indicating cleavage after V, 7S . Note the fragment 176-192 in Table 7. 
T at the 5 th cycle indicating cleavage after Vm. 

T at the l 51 -3 ri cycles, indicating increasingly common cleavages after R ]8 i, Ai 80 and Y, TO - 
Only the last of these correspond to peaks detected by mass spectrometry; 163-179 and 
180-192, see Table 7. The absence of the others can indicate that they are on 
fragments smaller than were examined in the mass spectrum. 

K at the 4*, S a , and 10* cycles indicating cleavages after E,«, Y 1? 9, and V m , respectively, 
all of which correspond to fragments observed by mass spectroscopy. See Table 7. 

A at the 1 st and 3 rt cycles indicating presence of the N-terminus of the substrate and 
cleavage after Vm, respectively. 

P at the 4 th and 8 a cycles indicating presence of the N-terminus of the substrate. 
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G at the 6* and 10 th cycles indicating presence of the N-terminus of the substrate. 

M at the 7 th cycle indicating presence of the N-terminus of the substrate and/or cleavage 

after Fu5. 

M at the 15 th cycle indicating cleavage after Vm- 
5 The 1 st cycle can indicate cleavage after Dm, see Table 7. 

R at the 4 th and 13* cycle indicating cleavage after Vn?. 
R at the 2 nd and 1 1 th cycle indicating cleavage after Y i79 . 

V at the 2 nd , 6 th , and 13 th cycle indicating cleavage after V, 75 , M J6 9 and presence of the N- 

terminus of the substrate, respectively. Note fragments beginning at 176 and 170 in 

10 Table 7. 

Y at the 1 st , 2 nd , and 14 th cycles indicating cleavage after V 175 , V 177 , and presence of the N- 

tenninus of the substrate, respectively. 
L at thell* and 12 th cycles indicating cleavage after V 177 , and presence of the N-terminus 
of the substrate, respectively, is the interpretation most consistent with the other data. 
15 Comparing to the mass spectrometry results we see that L at the 2 , 5 , and 9 cycles 

is consistent with cleavage after F 18 «, E 18J or M,«, and Y m> respectively. See Table 7. 

Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITH1 or NIH algorithms were chosen for further analysis. The digestion and prediction 
20 steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 

in proteasomal digest described here was specifically designed to include a predicted HLA-A1 
binding sequence, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 8. 
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Table 8. Predicted HLA binding by proteasomaUy generated fragments 







HLA 


SYFPEITHI 


NIH 1 


32 & (33) 


(G)MPEGDLVY 
V 




1 7 ffl} 


(2605} 

V J 






B*0702 


20 


<5 






n *c 1 f\ 1 

B*5101 


11 


314 


34 & (35} 


Y 


A 1 


24 r26^ 


<5 






A3 


16 (18) 


36 






B*2705 


1 7 


25 


36 


MPEGDLVY 


B*5l01 


1 c 
ID 


NP+ 


37 & (38) 


( P ) EvilJiJ V I VJN 

y 


A 1 
Al 




12 1 






A26 


23(17) 


NP 


39 


LVYVNYARTE 


A3 


21 




40 & (41) 


(Y) VNYARTED 
F 


A26 


(20) 


NP 






B*08 


15 


<5 






B*2705 


12 


50 I 


42 


NYARTEDFF 


A24 


NPf 


100 J 






Cw*0401 


NP 


120 


43 


YARTEDFF 


B*08 


16 


<5 


44 


RTEDFFKLE 


Al 


21 


<5 






A26 


15 


NP ■ 



fNo prediction 

HLA-A*0201 binding assay: 

HLA-A*0201 binding studies were preformed with PSMA 168 -i77, GMPEGDLVYV, (SEQ 
ED NO. 33) essentially as described in Example 3 above. As seen in figure 8, this epitope exhibits 
significant binding at even lower concentrations lhan the positive control peptides. The Melan-A 
peptide used as a control in this assay (and throughout this disclosure), ELAGIGILTV, is actually a 
variant of the natural sequence (EAAGIGBLTV) and exhibits a high affinity in this assay. 
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10 



15 



Example 5 

Cluster Analysis (PSMA 28 wio). 

Another peptide, RGIAEAVGLPSIPVHPIGYYDAQKLLEKMG, PSMA 281 - 3 io, (SEQ ID 

NO. 45), containing an Al epitope cluster from prostate specific membrane antigen, PSMA 283 -307 
(SEQ ID NO. 46), was synthesized using standard solid-phase F-moc chemistry on a 433A ABI 
Peptide synthesizer. After side chain deprotection and cleavage from the resin, peptide in ddH20 
was run on a reverse-phase preparative HPLC C18 column at following conditions: linear AB 
gradient (5% B/min) at a flow rate of 4 ml/min, where eluent A is 0.1% aqueous TFA and eluent B 
is 0.1% TFA in acetonitrile. A fraction at time 17.061 min containing the expected peptide as 
judged by mass spectrometry, was pooled and lyophilized. The peptide was then subjected to 
proteasome digestion and mass spectrum analysis essentially as described above. Prominent peaks 
from the mass spectra are summarized in Table 9. 
Tahle 9.PSMA^ ^nMa« P« a k Identification. 



PEPTIDE I SEQUENCE 



281-297 
286-297 



287-297 



RGIAEAVGLPSIPVHPI* 
AVGLPSIPVHPI** 



288-297 



298-310 



298-305 



281-305 



281-307 



286-307 



287-307 



288-307 



286-299 



287- 299 

288- 299 



287-310 



288-310 



VGLPSIFVHPI 
GLPSIPVHPI* 



RGIAEAVGLPS IPVHPIGYYDAQKl" 



RGI AE AVGLPS I PVHP I GYYDAQKLLE 



AVGLPSIPVHPIGYYDAQKLLEU 



VGLPS I PVHPIGYYDAQKLLE 
GLPSIPVHPIGYYDAQKLLE# 



281-299 RGIAEAVGLPSIPVHPIGY 



AVGLPSIPVHPIGY 



VGLPSIPVHPIGY 
GLPSIPVHPIGY 



VGLPS I PVHPIGYYDAQKLLEKMG 



GLPSI PVHP IGYYDAQKLLEKMG 



CALCULATE 
DMASS 

1727.07 

1200.46 



1129.38 
1030.25 



1516.5 



958.05 



2666.12 



2908.39 



2381.78 



2310.70 



2211.57 



1947 



1420.69 



1349.61 
1250.48 



2627.14 



2528.01 



Boldface sequences correspond to peptides predicted to bind to MHC, see Table 10. 
*Bv mass alone this peak could also have been 296-3 10 or 288-303. 

**By mass alone this peak could also have been 298-307. Combination of HPLC and mass 
spectrometry show that at some later time points this peak is a mixture of both species. 
| By mass alone this peak could also have been 289-298. 
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By mass alone this peak could also have been 281-295 or 294-306. 
§ By mass alone this peak could also have been 297-303. 
t By mass alone this peak could also have been 285-306. 
# By mass alone this peak could also have been 288-303 . 

None of mese alternate assignments are supported N-terminal pool sequence analysis. 

N-terminal Pool Sequence Analysis 

One aliquot at one hour of the proteasomal digestion (see Example 3 part 3 above) was 
subjected to N-terminal amino acid sequence analysis by an ABI 473A Protein Sequencer (Applied 
Biosystems, Foster City, CA). Determination of the sites and efficiencies of cleavage was based on 
consideration of the sequence cycle, the repetitive yield of the protein sequencer, and the relative 
yields of amino acids unique in the analyzed sequence. That is if the unique (in the analyzed 
sequence) residue X appears only in the nth cycle a cleavage site exists n-1 residues before it in the 
N-terminal direction. Li addition to helping resolve any ambiguity in the assignment of mass to 
sequences, these data also provide a more reliable indication of the relative yield of the various 

fragments than does mass spectrometry. 

For PSMA 251; j lfi .(SEQ ID NO. 45) this pool sequencing supports two major cleavage sites 
after V^t and I 297 among other minor cleavage sites. Reviewing the results presented in Fig. 9 
reveals the following: 

S at me 4« h and 1 1* cycles indicating cleavage after V 287 and presence of the N-terminus of 

the substrate, respectively. 
H at the 8* cycle indicating cleavage after V 287 . The lack of decay in peak height at 

positions 9 and 10 versus the drop in height present going from 10 to 11 can suggest 

cleavage after A 2S6 and E^s as well, rather than the peaks representing latency in the 

sequencing reaction. 

D at the 2 nd , 4 tt , and 7* cycles indicating cleavages after Y 299 , 1»7, and V 294> respectively. 

This last cleavage is not observed in any of the fragments in Table 10 or in the 

alternate assignments in the notes below. 
Q at the 6* cycle indicating cleavage after W 

M at the 10 th and 12 fll cycle indicating cleavages after Y^s and hsn, respectively. 

Epitope Identification 

Fragments co-C-teiminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPE1THI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include a predicted HLA-A1 
binding sequence, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 10. 
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Table 10. 

Predicted FT A binding bv proteasomallv generated fragments: PSMA 28I . 31Q 



SEQ ID NO. 


PEPTIDE 


HLA 


SYFPEITHI 


nih 


47 & (48) 


(G)LPSIPVH 
PI 


A*0201 


16(24) 


(24) 


B*0702/B7 


Zj 


19 


B*5101 


1/1 


O /Z 


Cw*0401 


XTP+ 


70 


49 & (50) 


(P) IGYYDAQ 
KL 


A*0201 


(16) 


<5 


A26 


(20) 


XTI> 

JNr 


B*2705 


16 


Zj 


B*2709 


15 


NP 


B*5101 


21 


57 


Cw*0301 


NP 


24 


51 & (52) 
53 


(P)SIPVHPI 
GY 

IPVHPIGY 


Al 


21 (27) 


<5 


A26 


22 


NP 


A3 


16 


<5 


B*5101 


16 


NP 


54 


YYDAQKLLE 


Al 


22 


<5 



•f No prediction 



As seen in Table 10, N-terminal addition of authentic sequence to epitopes can often generate still 
useful, even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (G)LPSIPVHPI with HLA-A*0201, where the 10-mer can be used as a vaccine 
useful with several MHC types by relying on Nominal trimming to create the epitopes for 
HLA-B7, -B*5101, andCw*040L 
HLA-A*0201 binding assay: 

HLA-A*0201 binding studies were preformed with PSMA 288 -297, GLPSIPVHPI, (SEQ ID 
NO. 48) essentially as described in Examples 3 and 4 above. As seen in figure 8, this epitope 
exhibits significant binding at even lower concentrations than the positive control peptides. 
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Example 6 

Ouster Analysis (PSMA^si)- 

Another peptide, SSffiGNYTIilVDCTPLMYSLVHLTKEL, PSMA^si, (SEQ ID NO. 

55) containing an epitope cluster from prostate specific membrane antigen, was synthesized by 

MPS (purity >95%) and subjected to proteasome digestion and mass spectrum analysis as 

described above. Prominent peaks from the mass spectra are summarized in Table 11. 



Table 11, PSMA^dSJ Mas * Pfiak Identification. 



MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE 


CALCULATED 
MASS(MH*) 


1238.5 


454464 


SSIEGNYTLRV 


1239.78 


1768.38±0.60 


454469 


SSI EGNYTLRVDCTPL 


1768.99 


1899.8 


454-470 


SSI EGNYTLRVDCTPLM 


1900.19 


1097.63±0.91 


463-471 


RVDCTPLMY 


1098.32 


2062.87±0.68 


454-471* 


SSIEGNYTLRVDCTPLMY 


2063.36 


1153 


472-481** 


SLVHNLTKEL 


1154.36 


1449.93±1.79 


470-481 


MYSLVHNLTKEL 


1448.73 



Boldface sequence correspond to peptides predicted to bind to MHC, see Table 12, 
* On the basis of mass alone this peak could equally well be assigned to the peptide 455472 
however proteasomal removal of just the N-tenninal amino acid is considered unlikely. If the issue 
were important it could be resolved by N-terminal sequencing. 
**On the basis of mass this fragment might also represent 455464. 



Epitope Identification 

Fragments co-C-terminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or NIH algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 12. 
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Table 12. Predicted HIA binding by proteasomally generated fragments 



SEO ID NO 


PEPTIDE 


HLA 


SYFPEITHI 


NIH 


56 & (57) 


(S) IEGNYTLRV 


Al 


(19) 


<5 


58 


KGNYTLRV 


A*0201 


16 (22) 


<5 


B*5101 


15 


NPf 


59 & (60) 


( Y) TLRVDCTPL 


A*0201 


20(18) 


(5) 


A26 


16(18) 


NP 


B7 


14 


40 


B8 


23 


<5 


B*2705 


12 


30 


Cw*0301 


NP 


(30) 


61 


LRVDCTPLM 


5*2705 


20 


600 


B*2709 


20 


NP 


62 & (63) 


( L) RVDCTPLMY 


Al 


32 (22) 


125 (13.5) 


A3 


25 


<5 


A26 


22 


NP 


B*2702 


NP 


(200) 


B*2705 | 13 (NP) 


(1000) 



tNo prediction 



As seen in Table 12, N-tenninal addition of authentic sequence to epitopes can often generate still 
useful, even better epitopes, for the same or different MHC restriction elements. Note for example 
the pairing of (L)RVDCTPLMY (SEQ ID NOS 62 and (63)) with HLA-B*2702/5, where the 10- 
mer has substantial predicted halftimes of dissociation and the co-C-terminal 9-mer does not. Also 
note the case of SIEGNYTLRV (SEQ ID NO 57) a predicted HLA-A*0201 epitope which can be 
used as a vaccine useful with HLA-B*5101 by relying on N-terminal trimming to create the 
epitope. 

HLA-A*0201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMA**^, TLRVDCTPL, (SEQ ID NO. 60). As seen in figure 10, this epitope was 
found to bind HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (HBV,^; 
SEQ ID NO: 24) used as a positive control. Additionally, PSMA46M69 > (SEQ ID NO. 59) binds 
nearly as well. 

ELISPOT analysis: PSMAj^ti (SEQ ID NO. 62) 

The wells of a nitrocellulose-backed microtiter plate were coated with capture antibody by 
incubating overnight at 4°C using 50 ul/well 0 f 4ug/ml murine anti-human y-IFN monoclonal 
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antibody in coating buffer (35 mM sodium bicarbonate, 15 mM sodium carbonate, pH 9.5). 
Unbound antibody was removed by washing 4 times 5 miiL with PBS. Unbound sites on the 
membrane then were blocked by adding 200ul/well of RPMI medium with 10% serum and 
incubating 1 hr. at room temperature. Antigen stimulated CD8* T cells, in 1:3 serial dilutions, 

5 were seeded into the wells of the microtiter plate using lOO^l/well, starting at 2xl0 5 cells/well. 
(Prior antigen stimulation was essentially as described in Scheibenbogen, C. et al. Int. J. Cancer 
71:932-936, 1997. PSM^Wm (SEQ ID NO. 62) was added to a final concentration of 10ug/ml 
and IL-2 to 100 U/ml and the cells cultured at 37°C in a 5% C0 2 , water-saturated atmosphere for 
40 hrs. Following this incubation the plates were washed with 6 times 200 ul/well of PBS 

10 containing 0.05% Twecn-20 (PBS-Tween). Detection antibody, 50^1/well of 2g/ml biotinylated 
murine anti-human y-IFN monoclonal antibody in PBS+10% fetal calf serum, was added and the 
plate incubated at room temperature for 2 hrs. Unbound detection antibody was removed by 
washing with 4 times 200 ul of PBS-Tween. 100^1 of avidin-conjugated horseradish peroxidase 
(Pharmingen, San Diego, CA) was added to each well and incubated at room temperature for 1 hr. 

15 Unbound enzyme was removed by washing with 6 times 200 ul of PBS-Tween. Substrate was 

prepared by dissolving a 20 mg tablet of 3-amino 9-ethylcoarbasole in 2.5 ml of N, N- 
dimethylformamide and adding that solution to 47,5 ml of 0.05 M phosphate-citrate buffer (pH 
5.0). 25 ul of 30% H 2 0 2 was added to the substrate solution immediately before distributing 
substrate atlOO nl/well and incubating the plate at room temperature. After color development 

20 (generally 15-30 min.), the reaction was stopped by washing the plate with water. The plate was 
air dried and the spots counted using a stereomicroscope. 

Figure 11 shows the detection of PSMA463-471 (SEQ ID NO. 62)-reactive HLA-A1+ CD8 + T 
cells previously generated in cultures of HLA-A1 + CD8 + T cells with autologous dendritic cells 
plus the peptide. No reactivity is detected from cultures without peptide (data not shown). In this 

25 case it can be seen that the peptide reactive T cells are present m the culture at a frequency between 

1 in 2.2xl0 4 and 1 in 6.7xl0 4 . That this is truly an HLA-A1 -restricted response is demonstrated by 
the ability of anti-HLA-Al monoclonal antibody to block y-IFN production; see figure 12. 
Example 7 

Cluster Analysis (PSMAoewbt). 
30 Another peptide, FDKSNPIVLRMMNDQLMFLERAFIDPLGLPDRPFY PSMA^s?, 

(SEQ ID NO. 64) containing an A2 epitope cluster from prostate specific membrane antigen, 
PSMA«6(w*i (SEQ ID NO 65), was synthesized by MPS (purity >95%) and subjected to proteasome 
digestion and mass spectrum analysis as described above. Prominent peaks from Die mass spectra 
are summarized in Table 13. 
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MS PEAK 
(measured) 


PEPTIDE 


SEQUENCE 


CALCULATED 
MASS (MH 1 ") 


906.17±0.65 


681-687** 


LPDRPFY 


908.05 


1287.73±0.76 


677-687** 


DPLGLPDRPFY 


1290.47 


1400.3±1.79 


676-687 


IDPLGLPDRPFY 


1403.63 


i c/io a_i_ 1 in 


£7< Aft 7 
0 / J-OO / 




1550.80 




0 /*I-Oo / 


AFTDPTjGLPDRPFY 


1621.88 


1775.48±1.32 


673-687* 


RAFI DPLGLPDRPFY 


1778.07 


2440.2a: 1.3 






2442.93: 


1904.63±1.56 


672-687* 


ERAFIDPLGLPDRPFY 


1907.19 


2310.6±2.5 


653-671 


FDKSNPIVLRMMNDQLMFL 


2313.82 


2017.4±1.94 


671-687 


LERAFIDPLGLPDRPFY 


2020.35 


2197.43±1.78 


653-670 


FDKSNPIVLRMMNDQLMF 


2200.66 



JSUlUlilCe fcSCL[UwllVC ULM.lcapv.uu w pvyuuva |/iw*vvv« w -~ -* 7 

* On the basis of mass alone this peak could equally well be assigned to a peptide beginning at 
654, however proteasomal removal of just the N-terminal amino acid is considered unlikely. If the 
issue were important it could be resolved by N-terminal sequencing. 

** On the basis of mass alone these peaks could have been assigned to internal fragments, but 
given the overall pattern of digestion it was considered unlikely. 



10 



15 



Epitope Identification 

Fragments conterminal with 8-10 amino acid long sequences predicted to bind HLA by 
the SYFPEITHI or N1H algorithms were chosen for further study. The digestion and prediction 
steps of the procedure can be usefully practiced in any order. Although the substrate peptide used 
in proteasomal digest described here was specifically designed to include predicted HLA-A2.1 
binding sequences, the actual products of digestion can be checked after the fact for actual or 
predicted binding to other MHC molecules. Selected results are shown in Table 14. 
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Table 14. Predicted HLA binding by proteasomally generated fragments 



SEQ ID NO 


PEPTIDE 


HLA 


SYfrPElTtil 


\TTTT 

NTH 


66 & (67) 


(R)MMNDQLMF 
L 


A*0201 


24 (23) 


1360 (722) 


A*0205 


NPf 


71 (42) 


A26 


15 


NP 


B*2705 


12 


50 


68 


RMMNDQLMF 


B*2705 


17 


75 



fNo prediction 



5 

As seen in Table 14, N-terminal addition of authentic sequence to epitopes can generate 
still useful, even better epitopes, for the same or different MHC restriction elements. Note for 
example the pairing of (R)MMNDQLMFL (SEQ ID NOS. 66 and (67)) with HLA-A*02, where the 
10-mer retains substantial predicted binding potential. 

10 

HLA-A*0201 binding assay 

HLA-A*0201 binding studies were preformed, essentially as described in Example 3 
above, with PSMA^i, (SEQ ID NO. 66) and VMAmm, RMMNDQLMFL (SEQ NO. 67). As 
seen in figures 10, 13 and 14, this epitope exhibits significant binding at even lower concentrations 
15 than the positive control peptide (FLPSDYFPSV (HBV 18 . 27 ); SEQ ID NO: 24). Though not run in 

parallel, comparison to the controls suggests that PSMA662-«7i (which approaches the Melan A 
peptide in affinity) has the superior binding activity of these two PSMA peptides. 
Example 8 

Vaccinating with epitope vaccines. 

20 1. Vaccination with peptide vaccines : 

A. Intranodal delivery 

A formulation containing peptide in aqueous buffer with an antimicrobial agent, an 
antioxidant, and an immunomodulating cytokine, was injected continuously over several days into 
the inguinal lymph node using a miniature pumping system developed for insulin delivery 
25 (MiniMed; Northridge, CA). This infusion cycle was selected in order to mimic the kinetics of 

antigen presentation during a natural infection. 

B. Controlled release 

A peptide formulation is delivered using controlled PLGA microspheres as is known in the 
art, which alter the pharmacokinetics of the peptide and improve immunogenicity. This 
30 formulation is injected or taken orally. 
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C. Gene gun delivery 

A peptide formulation is prepared wherein the peptide is adhered to gold microparticles as 
is known in the art. The particles are delivered in a gene gun, being accelerated at high speed so as 
to penetrate the skin, carrying the particles into dermal tissues that contain pAPCs. 
5 D. Aerosol delivery 

A peptide formulation is inhaled as an aerosol as is known in the art, for uptake into 

* 

appropriate vascular or lymphatic tissue in the lungs. 
2. Vaccination with nucle ic acid vaccines: 

A nucleic acid vaccine is injected into a lymph node using a miniature pumping system, 
10 such as the MiniMed insulin pump. A nucleic acid construct formulated in an aqueous buffered 

solution containing an antimicrobial agent, an antioxidant, and an immunomodulating cytokine, is 
delivered over a several day infusion cycle in order to mimic the kinetics of antigen presentation 

during a natural infection. 

Optionally, the nucleic acid construct is delivered using controlled release substances, such 
15 as PLGA microspheres or other biodegradable substances. These substances are injected or taken 

orally. Nucleic acid vaccines are given using oral delivery, priming the immune response through 
uptake into GALT tissues. Alternatively, the nucleic acid vaccines are delivered using a gene gun, 
wherein the nucleic acid vaccine is adhered to minute gold particles. Nucleic acid constructs can 
also be inhaled as an aerosol, for uptake into appropriate vascular or lymphatic tissue in the lungs. * 

20 Example 9 

Assays for the effectiveness of epitope vaccines. 

1. Tetramer analysis : 

Class I tetramer analysis is used to determine T cell frequency in an animal before and 
after administration of a housekeeping epitope. Clonal expansion of T cells in response to an 
25 epitope indicates that the epitope is presented to T cells by pAPCs. The specific T cell frequency 

is measured against the housekeeping epitope before and after administration of the epitope to an 
animal, to determine if the epitope is present on pAPCs. An increase in frequency of T cells 
specific to the epitope after administration indicates that the epitope was presented on pAPC. 

2. Proliferation assay : 

30 Approximately 24 hours after vaccination of an animal with housekeeping epitope, pAPCs 

are harvested from PBMCs, splenocytes, or lymph node cells, using monoclonal antibodies against 
specific markers present on pAPCs, fixed to magnetic beads for affinity purification. Crude blood 
or splenoctye preparation is enriched for pAPCs using this technique. The enriched pAPCs are 
then used in a proliferation assay against a T cell clone that has been generated and is specific for 

35 the housekeeping epitope of interest. The pAPCs are coincubated with the T cell clone and the T 

cells are monitored for proliferation activity by measuring the incorporation of radiolabeled 
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thymidine by T cells. Proliferation indicates that T cells specific for the housekeeping epitope are 
being stimulated by that epitope on the pAPCs. 
3. Chromium release assay : 

A human patient, or non-human animal genetically engineered to express human class I 
5 MHC, is immunized using a housekeeping epitope. T cells from the immunized subject are used in 
a standard chromium release assay using human tumor targets or targets engineered to express the 
same class I MHC. T cell killing of the targets indicates that stimulation of T cells in a patient 
would be effective at killing a tumor expressing a similar TuAA. 
Example 10 

1 0 Induction of CTL response with naked DNA is efficient by Intra-lymph node immunization. 

In order to quantitatively compare the CD8+ CTL responses induced by different routes of 
immunization a plasmid DNA vaccine (pEGFPL33A) containing a well-characterized 
immunodominant CTL epitope ftom the LCMV-glycoprotein (G) (gp33; amino acids 33-41) 
(Oehen, S., et al.. Immunology 99, 163-169 2000) was used, as this system allows a comprehensive 

15 assessment of antiviral CTL responses. Groups of 2 C57BL/6 mice were immunized once with 
titrated doses (200-0.02ng) of pEGFPL33A DNA or of control plasmid pEGFP-N3, administered 
i.m. (intramuscular), i.d. (intradermal), i.spl. (intrasplenic), or i.ln. (intra-lymph node). Positive 
control mice received 500 pfu LCMV i.v. (intravenous). Ten days after immunization spleen cells 
were isolated and gp33-specific CTL activity was determined after secondary in vitro 

20 restimulation. As shown in Fig. 15, i.m. or i.d. immunization induced weakly detectable CTL 
responses when high doses of pEFGPL33A DNA (200ug) were administered. In contrast, potent 
gp33-specific CTL responses were elicited by immunization with only 2^g pEFGPL33A DNA 
i.spl. and with as little as 0.2p.g pEFGPL33A DNA given i.ln. (figure 15; symbols represent 
individual mice and one of three similar experiments is shown). Immunization with the control 

25 pEGFP-N3 DNA did not elicit any detectable gp33-specific CTL responses (data not shown). 
Example 11 

Intra-lymph node DNA immunization elicits anti-tumor immunity. 

To examine whether the potent CTL responses elicited following i.ln. immunization were 
able to confer protection against peripheral tumors, groups of 6 C57BL/6mice were immunized 

30 three times at 6-day intervals with 10ug of pEFGPL33A DNA or control pEGFP-N3 DNA. Five 
days after the last immunization small pieces of solid tumors expressing the gp33 epitope (EL4-33) 
were transplanted s.c. into both flanks and tumor growth was measured every 3-4d. Although the 
EL4-33 tumors grew well in mice that had been repetitively immunized with control pEGFP-N3 
DNA (figure 16), mice which were immunized with pEFGPL33A DNA i.ln. rapidly eradicated the 

35 peripheral EL4-33 tumors (figure 16). 
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Example 12 

Differences in lymph node DNA content mirrors differences In CTL response following intra- 
lymph node and intramuscular injection. 

P EFGPL33A DNA was injected i.ln. or im and plasmid content of the injected or draining 
lymph node was assessed by real time PCR after 6, 12, 24, 48 hours, and 4 and 30 days. At 6, 12, 
and 24 hours the plasmid DNA content of the injected lymph nodes was approximately three orders 
of magnitude greater than that of the draining lymph nodes following i.m injection. No plasmid 
DNA was detectable in the draining lymph node at subsequent time points (Fig. 17). This is 
consonant with the three orders of magnitude greater dose needed using im as compared to i.ln. 
injections to achieve a similar levels of CTL activity. CD8 V " knockout mice, which do not develop 
a CTL response to this epitope, were also injected i.ln. showing clearance of DNA from the lymph 
node is not due to CD8 + CTL killing of cells in the lymph node. This observation also supports the 
conclusion that Lin. administration will not provoke immunopathological damage to the lymph 



node. 



Example 13 

Administratioii of a DNA plasmid formulation of a therapeutic vaccine for melanoma to 
humans. 

SYNCHROTOPE TA2M, a melanoma vaccine, encoding the HLA-A2-restricted tyrosinase 
epitope SEQ ID NO. 1 and epitope cluster SEQ ID NO. 69, was formulated in 1% Benzyl alcohol, 
1% ethyl alcohol, 0.5mM EDTA, citrate-phosphate, pH 7.6. Aliquots of 80, 160, and 320 ug 
DNA/ral were prepared for loading into M1NIMED 407C infusion pumps. The catheter of a 
SILHOUETTE infusion set was placed into an inguinal lymph node visualized by ultrasound 
imaging. The assembly of pump and infusion set was originally designed for the delivery of insulin 
to diabetics and the usual 17mm catheter was substituted with a 31mm catheter for this application. 
The infusion set was kept patent for 4 days (approximately 96 hours) with an infusion rate of about 
25 ul/hour resulting in a total infused volume of approximately 2.4 ml. Thus the total adirrinistered 
dose per infusion was approximately 200, and 400 ug; and can be 800 ug, respectively, for the 
three concentrations described above. Following an infusion subjects were given a 10 day rest 
period before starting a subsequent infusion. Given Ihe continued residency of plasmid DNA in the 
lymph node after administration (as in example 12) and the usual kinetics of CTL response 
following disappearance of antigen, this schedule will be sufficient to maintain the immunologic 
CTL response. 
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Example 14 
Additional Epitopes. 

The methodologies described above, and in particular in examples 3-7, have been applied 
to additional synthetic peptide subslrates, leading to the identification of further epitopes as set for 

5 the in tables 15-36 below. The substrates used here were designed to identify products of 
housekeeping proteasomal processing that give rise to HLA-A*0201 binding epitopes, but 
additional MHC-binding reactivities can be predicted, as discussed above. Many such reactivities 
are disclosed, however, these listings are meant to be exemplary, not exhaustive or limiting. As 
also discussed above, individual components of the analyses can be used in varying combinations 

10 and orders. The digests of the NY-ESO-1 substrates 136-163 and 150-177 (SEQ ID NOS. 254 and 
255, respectively) yielded fragments that did not fly well in MALDI-TOF mass spectrometry. 
However, they were quite amenable to N-terminal peptide pool sequencing, thereby allowing 
identification of cleavage sites. Not all of the substrates necessarily meet the formal definition of 
an epitope cluster as referenced in example 3. Some clusters are so large, e.g. NY-ESO-Wm, that 

15 it was more convenient to use subslrates spanning only a portion of this cluster. In other cases, 
substrates were extended beyond clusters meeting the formal definition to include neighboring 
predicted epitopes. In some instances, actual binding activity may have dictated what substrate was 
made, as with for example the MAGE epitopes reported here, where HLA binding activity was 
determined for a selection of peptides with predicted affinity, before synthetic substrates were 

20 designed. 
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E3ttag 5 Iikelihood of Epitope Cross-reactivity on Non-target Tissues. 

As noted above PSA is a member of the kallikrein family of proteases, which is itself a 
subset of the serine protease family. While the members of this family sharing the greatest degree 
of sequence identity with PSA also share similar expression profiles, it remains possible that 
individual epitope sequences might be shared with proteins having distinctly different expression 
profiles. A first step in evaluating the likelihood of undesirable cross-reactivity is the identification 
of shared sequences. One way to accomplish this is to conduct a BLAST search of an epitope 
sequence against the SWBSPROT or Entrez non^redundant peptide sequence databases using the 
"Search for short nearly exact matches" option; hypertext transfer protocol accessible on the world 
wide web (http://www) at '^cbi.rilm^gov/blast/index.html''. Thus searching SEQ ID NO. 214, 
WVLTAAHCL against SWISSPROT Gimited to entries for homo sapiens) one finds four exact 
matches, including PSA. The other three are from kallikrein 1 (tissue kallikrein), and elastase 2A 
and 2B.' While these nine amino acid segments are identical, the flanking sequences are quite 
distinct, particularly on the C-terminal side, suggesting that processing may proceed differently and 
that thus the same epitope may not be liberated from these other proteins. (Please note that 
kallikrein naming is confused. Thus the kallikrein 1 [accession number P06870] is a different 
protein than the one [accession number AAD13817] mentioned in the paragraph on PSA above in 
the section on tumor-associated antigens). 

It is possible to test this possibility in several ways. Synthetic peptides containing the 
epitope sequence embedded in the context of each of these proteins can be subjected to in vitro 
proteasomal digestion and analysis as described above. Alternatively, cells expressing these other 
proteins, whether by natural or recombinant expression, can be used as targets in a cytotoxicity (or 
similar) assay using CD8 + T cells that recognize the epitope, in order to determine if the epitope is 
processed and presented. 
Example 16 
Epitope Clusters. 

Known and predicted epitopes are generally not evenly distributed across the sequences of 
protein antigens. As referred to above, we have defined segments of sequence containing a higher 
than average density of (known or predicted) epitopes as epitope clusters. Among the uses of 
epitope clusters is the incorporation of their sequence into substrate peptides used in proteasomal 
digestion analysis as described herein. Epitope clusters can also be useful as vaccine components. 
A fuller discussion of the definition and uses of epitope clusters is found in U.S. Patent Application 
No. 09/561,571 entitled EPITOPE CLUSTERS, previously incorporated by reference. 

The following tables (37-60) present 9-mer epitopes predicted for HLA-A2 binding using 
both the SYFPEITHl and NIH algorithms and the epitope density of regions of overlapping 
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epitopes, and of epitopes in the whole protein, and the ratio of these two densities. (The ratio must 
exceed one for there to be a cluster by the above definition; requiring higher values of this ratio 
reflect preferred embodiments). Individual 9-mers are ranked by score and identified by the 
position of their first amino in the complete protein sequence. Each potential cluster from a protein 
5 is numbered. The range of amino acid positions within the complete sequence that the cluster 

covers is indicated as are the rankings of the individual predicted epitopes it is made up of. 

Table 37 

BEMAS-NIH/Parker algorithm Results for gplOO 



Rank 


Start 


Score 


Rank 


Start 


Score 


1 


619 


1493 


21 


416 


19 


2 


602 


413 


22 


25 


18 


3 


162 


226 


23 


566 


17 


4 


18 


118 


24 


603 


15 


5 


178 


118 


25 


384 


14 


6 


273 


117 


26 


13 


14 


7 


601 


81 


27 


290 


12 


8 


243 


63 


28 


637 


10 


9 


606 


60 


29 


639 


9 


10 


373 


50 


30 


485 


9 

■ 


11 


544 


36 


31 


453 


8 


12 


291 


29 


32 


102 


8 


13 


592 


29 


33 


399 


8 


14 


268 


29 


34 


456 


7 


IS 


47 


27 


35 


113 


7 


16 


585 


26 


36 


622 


7 


17 


576 


21 


37 


69 


7 


18 


465 


21 


38 


604 


6 


19 


570 


20 


39 


350 


6 


20 


9 


19 


40 


583 


5 
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Table 38 

SYFPEITHI (Rammensee algorithm) Results for gplOO 
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Table 39 

Prediction of clusters for gp100 

Total Ms: 661 
Total 9-mers: 653 
SYFPEITHI 1 6: 1 09 9-mers 
NIH 6: 40 9-mers 



I SYFPEITHI 



NIH 



Cluster 0 


Ms 


Epitopes (by Rank) 


Ouster 


Epitopes/AA 
Whole Pr 


Ratio 


1 


2to26 


39,12, 109,34,55,11,9, 
108.107,74.4 


0.440 


0.165 


2.668 1 


2 


69-115 


72.71,106,53,85, 105, 
70. 84, 69, 104 


0.213 


0.165 


1290 


3 


95-115 


85, 105. 70, 84. 69 


0.238 


0.165 


A AAA 1 

1 .444 | 


4 


162-188 


2,52. 17,103.16.51 


0.222 


A A OC 

0.165 


1.348 1 


5 


205-225 


50. 68, 102. 101 


0.190 


A A 

0.165 


A A rc III 

1.155 1 


6 


243-258 


28.49 


0.125 


0.165 


A 7CO 1 

0.758 1 


7 


267-306 


48,21,38,27,20.99.83,37.67 


0.225 


A A f\r 

0.165 


4 Oft A 1 

1.364 1 


8 


322-332 


47,82 


AM, A t*\. *%. 

0.182 


0.165 


a a ao 1 

1.103 I 


9 


343-358 


66.33 


AM. A A MB 

0.125 


0.165 


A -jro f 

0.758 I 


10 


371-381 


65^26 


0.182 


a a f*e 

0.165 


1.103 | 


11 


397-421 


36,25,64,98, 81,97,63, 96 


0.320 


0.165 


A t\A A 1 

1.941 | 


12 


442-476 


95. 46, 11-, 62. 15.3.35.24,94 


0.257 


0.165 


1.559 


13 


482-502 


93.31.45.93 


0.190 


0.165 


1.155 


14 


539-552 


91 1 23 


0.143 


0.165 


0.866 I 


15 


556-627 


79, 78,90,30,29,61,44,60,77.14, 
89,43.88,10,87.42,22,41,59,8. 
6.76,7.5, 13,58,57,1,40.75.19 


0.431 


0.165 


2.611 


1 


9to33 


20, 26. 4, 22 


0.160 


0.061 


2.644 


2 


268-281 


14.6 


0.143 


0.061 


2.361 


3 


290-299 


27.12 


0.200 


0.061 


3.305 


4* 


102-121 


32.35 


0.100 


0.061 


1.653 


5* 


373-392 


10.25 


0.100 


0.061 


1.653 


6 


453-173 


31,34, 18 


0.143 


0.061 


2.361 


7 


566-600 


23. 19. 17.40.16,13 


0.171 


0.061 


2.833 


8 


601-614 


7, 2, 24, 38, 9 


0.357 


0.061 


5.902 


9 


619-630 


1.36 


0.17 


0.061 


2.754 


10 


637-647 


28,29 


0.18 


0.061 


3.005 1 



'Neaiby but not overlapping epitopes 
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Table 40 

BIMAS-NIH/Parker algorithm Results for PSMA 



1 Rank 


Start 


Score 1 


I 1 


663 


1360 


2 


711 


1055 


3 


4 


485 


4 


27 


400 


5 


26 


375 


6 


668 


261 


1 7 


707 


251 


8 


469 


193 


9 


731 


177 


10 


35 


67 


11 


33 


64 


12 


554 


59 


13 


427 


50 


14 


115 


47 


15 


20 


40 


16 


217 


26 


! 17 


583 


24 


18 


415 


19 


1 19 


193 


14 


20 


240 


12 


21 


627 


11 1 


22 


260 


10 


23 


130 


10 


24 


741 


9 


25 


3 


9 


26 


733 


8 


27 


726 


7 


28 


286 


6 1 


29 


174 


5 


| 30 


700 


5 1 



-123- 



WO 2003/008537 



PCT/US2002/010189 



Table 41 

SYFPEITHI (Rammensee algorithm) Results for PSMA 



Rank 


Start 


Score 


Rank 


ni j. 

Start 


ocore 


Rank 


otart 


Score 


1 


469 


27 


31 


26 


20 


61 


305 


17 


2 


27 


27 


32 


3 


* A 

20 


! 62 


304 


17 I 


3 


741 


26 


33 


/— pi ^ 
583 


1 A 

19 


63 


An/ 

286 


17 


4 


711 


26 


34 


579 


19 


64 


282 


17 


5 


35 4 


25 


35 


c r- A 

554 


19 


65 


169 


17 


6 


4 


25 


36 


550 


1 A 

19 


^ j^ 

66 


142 


17 


7 


663 


24 


37 


547 


19 


67 


122 


17 


8 


130 


24 


a jt* 

38 


390 


19 


68 


738 


16 


9 


57 


24 


39 


219 


19 


69 


634 


16 


10 


707 


23 


40 


193 


19 


70 


631 


16 


11 


260 


23 


41 


700 


18 


71 


515 


16 


12 


20 


23 


42 


ji p-j ^ 

472 


18 


72 


456 


16 


13 


603 


22 


43 


364 


18 


73 


A Af\ 

440 


16 


14 


218 


22 


44 


317 


18 


ji 

74 


385 


16 


15 


109 


22 


45 


253 


-t j^ 

18 


75 


373 


16 


16 


731 


21 


.ji ^ 

46 


91 


18 


76 


*S AT-~ f 

365 


16 


17 


668 


21 


47 


61 


18 


77 


361 


16 


18 


660 


21 


48 


13 


18 


78 


289 


At 

16 


19 


507 


21 


49 


733 


17 


79 


278 


16 


20 


454 


21 


50 


673 


17 


80 


258 


16 


21 


427 


21 


51 


671 


17 


81 


247 


16 


22 


ICO 

358 


21 


52 


642 


17 


82 


217 


16 


23 


284 


21 


53 


571 


17 


83 


107 


16 


24 


115 


21 


54 


492 


17 


84 


100 


16 


25 


33 


21 


55 


442 


17 


85 


75 


16 


26 


606 


20 


56 


441 


17 


86 


37 


16 


27 


568 


20 


57 


397 


17 


87 


30 


16 


28 


473 


20 


58 


391 


17 


88 


21 


16 


29 


461 


20 


59 


357 


17 








30 


200 


20 


60 


344 


17 
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Table 42 

Prediction of clusters for prostate-specific membrane antigen (PSMA) 

Total Ms: 750 
Total 9-mers: 742 
SYFPEITHI 16: 88 9-mers 
NIH 5: 30 9-mers 



SYFPEITHI 



Cluster # 

T 



6 



]_ 
8 



10 



11 



12 



13 



14 



15 



16 



17 



18 



Aas 

3to12 



Epito pes (by rank] 
32,6 



Eprtopes/AA 
Cluster Whole Pr Ratio 

0.200 0.117 1.705 



1345 



13.12,88,31,2, 87, 25, 86 



0.242 0.117 2.066 



57-69 



9,47 



100-138 



84.83,15,24,67,8 



193-208 



40, 30 



217-227 



82, 14, 39 



247-268 
278-297 



81,45, 80,11 
79. 64, 23, 63, 78 



354-381 



5. 59. 22, 77, 43, 76, 75 



385-405 



74, 38, 58, 57 



440450 



73, 56, 55 



454481 



20,72, 29,1,42, 28 



507-523 



17,71 



547-562 



37, 36, 35 



568-591 



27, 53, 34, 33 



603-614 



13, 26 



631-650 



70, 69, 52 



660-681 



18,7,17, 51,50 



700-719 



41,10,4 



2 2043 
3* 415435 



15, 5,4,11 , 10 
18,13 



4 663-676 

5 700-715 

6 726-749 
*Nearby but not overlapping epitopes 



li 
30, 7, 3 

27, 9, 26, 24 



0.154 0.117 1.311 



0.154 0.117 1.311 



0.111 0.117 



0.273 



0.182 
0.250 



0.117 
0.117 



0.250 



0.190 



0.214 0.117 



0.118 



0.117 



0.167 



0.150 



0.227 



0.150 



0.208 
0.095 



0.040 
0.040 



0.143 
0.188 
0.167 



0.040 
0.040 
0.040 



0.947 



0.117 2.324 



1.550 
2.131 



0.117 2.131 



0.117 1.623 



0.273 0.117 2.324 



1.826 



1.003 



0.188 0.117 1.598 
0.167 0.117 1.420 



0.117 1.420 



0.117 1.278 



0.117 1.937 



0.117 1.278 




5.208 
2.381 



3.571 
4.688 
4.167 
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Table 43 

BIMAS-NIH/Parker algorithm Results for PSA 



Rank 


Start 


Score 


1 


7 


607 


2 


170 


243 


3 


52 


124 


4 


53 


112 


5 


195 


101 


6 


165 


23 


7 


72 


18 


8 


245 


18 


9 


2 


16 


10 


59 


16 


11 


122 


15 


12 


125 


15 


13 


191 


13 


14 


9 


8 


15 


14 


6 


16 


175 


5 


17 


130 


5 
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Table 44 



KanK 


atari 




1 


72 


0/£ 
20 


2 


17U 


22 


3 


CI 

53 


00 


4 


7 


oo 

22 


5 


OS/I 

234 


01 
21 


6 


loo 


01 
21 


1 


1 A A 

14U 


01 

21 


8 


00 


0 1 

21 


9 


241 


OA 

2U 


10 


175 


OA 

20 


11 


12 


OA 

2U i 


12 


41 


1 A 

iy 


13 


20 


1 A 

ly 


14 


1 A 

14 


1 A 

iy 


15 


130 


1 o 
lo 


16 


124 


1 o 
10 


17 


121 


1 0 


18 


47 


1 o 

lo 


19 


17 


1 Q 

lo 


20 


218 


17 


21 


133 


1 H 

17 


22 


1 

125 


17 


23 


122 


17 


24 


tin 

118 


17 


25 


110 


17 


26 


67 


17 


AAV 

27 


52 


17 


28 


21 


1 o 

17 


29 


10 


17 
1 / 


30 


2 


17 


31 


184 


16 


32 


.179 


16 


33 


158 


16 


34 


79 . 


16 


35 


73 


16 


36 


4 


16 



-127- 



WO 2003/008537 



PCT/US2002/010189 



Table 45 

Prediction of clusters for prostate specific antigen (PSA) 

Total AAs: 261 
Total 9-mers: 253 
SYFPEITHI 16: 36 9-mers 
NIH 5: 17 9-mers 

Epitopes/AA 



Cluster # AAs Epitopes (by rank) Cluster Whole Pr Ratio 



SYFPEITHI 


1 


2 to 29 


30.36,4.11,14,29,19.13,28 


0.321 


0.138 


2.330 




2 


41-61 


12, 18, 27. 3 


0.190 


0.138 


1.381 




3 


66-87 


8,26,1.35.34 


0.227 


0.138 


1.648 




4 


110-148 


25,24, 17,23,16,22,15. 21,7 


0.184 


0.138 


1.332 




5 


158-192 


33. 6. 2, 10. 32, 31 


0.171 


0.138 


1.243 




6 


234-249 


5.9 


0.125 


0.138 


0.906 




7* 


118-133 


24, 17, 23. 16, 22 


0.313 


0.138 


2.266 




8* 


118-138 


24.17,23,16.22,15 


0.286 


0.138 


2.071 


NIH 


1 


2-22 


9,1,14,15 


0.190 


0.065 


2.924 




2 


52-67 


3, 4, 10 


0.188 


0.065 


2.879 




3 


122-138 


11,12, 17 


0.176 


0.065 


2.709 




4 


165-183 


6, 2. 16 


0.158 


0.065 


2.424 




5 


191-203 


13,5 


0.154 


0.065 


2.362 




6** 


52-80 


3,4,10,7 


0.138 


0.065 


2.118 



These clusters are internal to the less preferred cluster #4. 



Includes a nearby but not overlapping epitope. 
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Table 46 

BIMAS-NIH/Parker algorithm Results for PSCA 



Rank 


Start 


Score 


1 


43 


R>3 


2 


5 


84 


3 


7 


79 


4 


109 


36 


5 


1 AC 

105 


1©5 


6 


108 


24 


7 


14 


21 


8 


20 


18 


9 


115 


"8 


10 


42 




11 


36 


15 


12 


99 


9 


13 


58 


8 



Table 47 

SYFPEITHI (Rammensee algorithm) Results for 
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Table 48 

Prediction of clusters for prostate stem cell antigen (PSCA) 

Total AAs: 123 
Total 9-mers: 1 1 5 
SYFPEITHI 16:33; 
SYFPEITHI 20: 13 
NIH 5:13 













Epitopes/AA 




SYFPEITHI >16 


Cluster # 
1 
2 
3 


AAs 

1 to28 

43-71 

92-123 


Epitopes (by renk) 

20.31,19.4,7,12,33, 18,13,2,11 
25,30,24,9,17,16,15,29 
32, 23, 6. 27, 14. 22, 3, 26. 10. 
1.8.21.5 


Cluster 

0.393 
0.276 
0.406 


Whole Pr. 
0268 
0.268 
0.268 


Ratio 
1.464 
1.028 
1.514 


SYFPEITHI >20 


1 

2 


5to28 
99-123 


4,7.12,13,2.11 

6. 3. 10, 1. 8. 5 


0.250 
0.240 


0.106 
0.106 


2.365 
2.271 




1 

2 
3 
4* 


5 to 28 

36-51 

99-123 

105-116 


2.3,7,8 

11.10.1 
12,5.6,4,9 

5,6,4 


0.167 

0.188 
0.200 
0.250 


0.106 

0.106 
0.106 
0.106 


1.577 

1.774 
1.892 
2.365 



*This cluster is internal to the less preferred cluster #3. 
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In tables 49-60 epitope prediction and cluster analysis data for each algorithm are 
presented together in a single table. 
Table 49 

Prediction of clusters for MAGE-1 (NIH algorithm) 

Total Ms: 309 
Total 9-mers: 301 
NIH 5:19 9-mers 



Cluster # 


AAs 


Epitope 


Start 


NIH 




Epltopes/AA 


Ratio 






Rank 


Position 


Score 


Cluster 


Whole Pr. 


1 


18-32 


16 


18 


9 


0.133 


0.063 


2.112 






19 


24 


7 








2 


101-113 


14 


101 


11 


0.154 


0.063 


2.442 






7 


105 


44 








3 


146-159 


9 


146 


32 


0.143 


0.063 


2.263 






3 


151 


169 








4 


169-202 


10 


169 


32 


0.176 


0.063 


2.796 






13 


174 


16 












18 


181 


8 












17 


187 


8 












6 


188 


74 












5 


194 


110 








5 


264-277 


2 


264 


190 


0.143 


0.063 


2.263 






12 


269 


20 








6 


278-290 


1 


278 


743 


0.154 


0.063 


2.437 






11 


282 


28 
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Table 50 

Prediction of clusters for MAGE-1 (SYFPEITHI algorithm) 



Total AAs: 309 
Total 9-mers: 301 
SYFPEITHI 16: 46 9-mers 



Cluster # 


Aas 


Epitope 


otan 


SYFPEITHI 


Enitan.es/AA 

t«|JIIV|Jww// , \n 




Dncttiftn 


Score 

W WW 1 W 


Cluster Whole Ratio 


1 


7-49 


zz 


7 

f 


19 

1 w 


0 233 0153 1.522 ! 




y 


10 


22 








07 


4A 

lo 


1 o 








AG. 
16 


Oft 
ZU 


ZU 








OO 

zo 


00 

zz 


1R 








on 


OA 


1ft 








oo 
00 


01 


17 








Oft 

30 


00 


1R 
IO 








o 


00 


9fi 








17 


H 1 


20 




A*\ 

2 


89-132 


4ft 
10 


AO 


ZZ 


0 973 0 153 1 783 

U.Cl J V. Iww I.I WW 






18 


00 

yz 


ZV 




• 




"7 

7 


00 
90 


oo 
zo 








oo 

23 


OA 

yo 


1Q 

1*7 








43 


QQ 

yo 


1ft 
JO 








4 




ZO 








8 


HftK 


ZO 








34 


4ft7 


17 








oc 
00 


1AA 


17 
1 1 








00 


11^ 
I iO 


17 








07 
0/ 


I 10 


17 








ny 


194 

IZH 


90 




3 


167-203 


44 


1G7 
10/ 


1fi 

IO 


0 970 0153 1 766 

U.L'V Wtlww li/WU 






20 


1CQ 

loy 


on 
zu 








12 


i/4 


01 

Z I 








Oil 

24 


loi 










0 


1P7 

lo/ 


OA 
Z*f 








31 


loo 


1ft 

IO 








25 


191 


19 








38 


192 


17 








1 


194 


27 








13 


195 


21 




4 


230-246 


14 


230 


21 


0.118 0.153 0.769 






39 


238 


17 




5 


264-297 


15 


264 


21 


0.235 0.153 1.538 






32 


269 


18 








40 


270 


17 








26 


271 


19 








46 


275 


16 








3 


278 


26 








21 


282 


20 








41 


289 


17 
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Table 51 

Prediction of clusters for MAGE-2 (NIH algorithm) 



Total AAs: 314 
Total 9-mers: 308 
NIH >= 5: 20 9-mers 



Cluster # 


AAs 


Epitope 


oiart 






Pnifnnp/AA 


Ratio 






Rank 


Position 


score 


uiusier 


Whnlp Pr 


1 


101-120 


18 


101 


5.373 


0.150 


0.065 








16 


108 


6.756 












i 


1 1£ 


R97 








2 


153-167 


8 


153 


31.883 


0.200 


0.065 


3.080 






4 


158 


168.552 












7 


159 


32.138 








3 


169-211 


14 


169 


8.535 


0.209 


0.065 


3.223 






19 


174 


5.346 












6 


176 


49.993 












11 


181 


15.701 












15 


188 


7.536 












12 


195 


12.809 












5 


200 


88.783 












10 


201 


16.725 












17 


203 


5.609 








4 


271-284 


3 


271 


398.324 


0.143 


0.065 


2.200 






9 


276 


19.658 
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Table 52 

Prediction of clusters for MAGE-2 (SYFPEITHI algorithm) 

Total Ms: 314 
Total 9-mers: 308 



SYFPEITHI 16: 52 9-mers 



wiuaici w 


AAs 


EdHodb 

9m^m^ ******** 


Start 


SYFPEITHI 


Cluster 


EnitoDes/AA 


Ratio 


Rank 


Position 


Score 


Whole Pr. 


1 
I 


15-32 


13 


15 


21 


0.278 


0.169 


1.645 




18 


18 












43 


20 


16 












30 


22 


18 












21 


24 


19 








o 


37-56 


31 


37 


18 


0.250 


0.169 


1.481 




16 

i w 


40 


20 












44 


44 


16 












14 


45 


21 












22 


48 


19 










96-133 


36 


96 


17 


0.211 


0.169 


1.247 






46 


101 


16 












6 


108 

1 w W 


25 












47 


109 


16 












2 


112 


27 












37 


120 


17 












38 


125 


17 












17 


131 


20 








4 


153-216 


12 


153 


22 


0.344 


0.169 


2.036 




39 


158 


17 












7 


159 


25 












23 


161 


19 












24 


162 

1 Wte 


19 














lu*r 


16 












49 


167 


16 












32 


170 


18 














171 


16 












4 


174 


26 












9 


176 


24 












51 


177 


16 

• 












15 


181 


21 












25 


188 


19 












18 


194 


20 












33 


195 


18 












19 


198 


20 












3 


200 


27 












1 


201 


28 












40 


202 


17 












10 


203 


23 












s? 


208 


16 








5 


237-254 


26 


237 


19 


0.167 


0.169 


0.987 




27 


245 


19 












34 


548 


18 








6 


271-299 


8 


271 


25 


0.241 


0.169 


1.430 




35 


276 


18 










• 


41 


277 


17 




> 








11 


278 


23 












28 


283 


19 












20 


285 


20 












4? 


291 


17 
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Table 53 

Prediction of clusters for MAGE-3 (NIH algorithm) 

Total AAs: 314 
Total 9-mers: 308 
NIH 5: 22 9-mers 



AAs 



101-120 



153-167 



Epitope 
Rank 

15 
21 
8 
2 
18 



22 



Start 
Position 

101 
105 
108 
112 
153 
158 
159 



NIH 
Score 

11.002 
6.488 
49.134 
339.313 
7.776 
51.77 
5.599 



174-209 



17 

7 

13 

19 

14 

5 

12 



174 
176 
181 
188 
195 
200 
201 



8.832 
49.993 
15.701 
7.536 
12.809 
88.783 
16.725 



Cluster 

0.200 



Epitopes/AA 
Whole Pr. Ratio 

0.071 2.800 



0.194 



237-251 



16 
4 

20 



271-284 



1 

11 



237 10.868 0.200 

238 148.896 

243 6.88 . 

271 2655.495 0.143 

276 1 9.658 



0.071 



0.071 



0.071 



0.071 



2.800 



2.722 



2.800 



2.000 
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Table 54 

Prediction of clusters for MAGE-3 (SYFPEITHI algorithm) 

Total AAs: 314 
Total 9-mers: 308 



SYFPEITHI 16: 47 9-mers 



Clustertf 


AAs 


EoitoDe 


Start 


SYFPEITHI 


Cluster 


Eoitooes/AA 


Ratio 1 


Rank 


. position 


Score 


Whole Pr. 


1 


15-32 


12 


15 


21 


0.278 


0.153 


1.820 1 


26 


18 


18 






I 






37 


20 


16 






1 






27 


22 


18 












18 


24 


19 








2 


38-56 


38 


38 


16 


0.263 


0.153 


1.725 




15 


40 


20 






I 






39 


44 


16 












13 


45 


21 






I 






19 


48 


19 








3 


101-142 


28 


101 


18 


0.190 


0.153 


A All) 1 

1.248 




40 


105 


16 






I 






1 


CO 


31 












6 


a 1 ) 


25 






I 






31 


ion 


17 






I 






32 


125 


17 






I 






16 


131 


20 






I 






41 


134 


16 








4 


153-216 


20 


153 


19 


0.313 


0.153 


2.048 1 




29 


156 


18 






I 






33 


158 


17 






I 






21 


159 


19 






I 




• 


34 


161 


17 






! 






42 


164 


16 






I 






43 


167 


16 






I 






10 


174 


22 






I 






8 


176 


23 






I 






14 


181 


21 






I 






22 


188 


19 






1 






44 


193 


16 






I 






11 


194 


22 












23 


195 


19 












45 


197 


16 












17 


198 


20 












3 


200 


27 












2 


201 


28 












35 


202 


17 












46 


208 


16 








5 


220-230 


5 


220 


26 


0.182 


0.153 


1.191 






47 


922 


16 








6 


237-246 


7 


237 


25 


0.200 


0.153 


1.311 


9 


238 


23 








7 


271-293 


4 


271 


27 


0.217 


0.153 


1.425 




30 


276 


18 












24 


278 


19- 












36 


283 


17 












25 


285 


19 
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Table 55 

Prediction of clusters for PRAME (NIH algorithm) 



Total AAs: 509 
Total 9-mers: 501 
NIH 5: 40 9-mers 



Cluster # 


AAs 


Epitope 


Start 


Mil 1 

NIH 




tip 1 iOpeS/Mr\ 


Ratio 




Rank 


Position 


Score 


Cluster 


wnoie rT. 


1 


33-47 


20 


33 


A O 

18 




n nan 


1 670 




17 


39 


21 








2 


71-81 


9 


71 


50 


0.2 


0.07984 








32 


73 


7 








3 


99-108 


23 


100 


A M 

15 


0.2 


0.07984 








24 


99 


jf Af\ 

13 








4 


126-135 


38 


126 


5 


0.2 


0.07984 


2.505 






35 


127 


6 








5 


224-246 


5 


224 


124 


0.130 


0.080 








8 


230 


63 












39 


238 


5 








6 


290-303 


18 


290 


18 


0.214 


0.080 


2.684 






14 


292 


23 












7 


295 


66 








7 


305-324 


28 


305 


10 


0.200 


0.080 


2.505 






30 


308 


8 












ZD 


Oie. 


I o 












36 


316 


6 








8 


394-409 


2 


394 


182 


0.188 


0.080 


2.348 






12 


397 


42 












31 


401 


7 








9 


422-443 


10 


422 


49 


0.227 


0.080 


2.847 






3 


425 


182 












34 


431 


7 












29 


432 


9 












4 


435 


160 








To 


459-487 


15 


459 


21 


0.172 


O080 


2.159 






11 


462 


45 












22 


466 


15 












40 


472 


5 












37 


479 


6 
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Table 56 

Prediction of clusters for PRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITHI 17: 80 9-mers 



U lUSter 7F 


AAc 
MM5 




Start 


SYFPEITHI 




Epitopes/AA 


Ratio 




Rank 


Position 


Score 


Cluster 


Whole Pr. 


1 


18-59 


00 


ID 


17 


0 93ft 
u.zoo 


0 160 

U. 1 Uw 


1 491 

1 1 






0U 


91 


1ft 
10 












00 


OR 

Z0 


17 












00 


00 


90 

zu 












Zz 


9.4 
04 


99 
zz 












El 
Ol 


V7 
Of 


1ft 

10 












c 
0 


00 

oy 


07 
Zf 












0°. 

zo 




92 












10 


44 
44 


94 
ZH 












4fi 

*rO 


51 


19 








2 


78-1 10 


Oft 
00 


7ft 


90 
ZU 


0 963 


0160 

U. 1 vu 


1 648 ; 






67 

Or 


80 


17 












59 

oz 


84 
o*t 


18 












Z*T 


86 

OU 


99 












CO 

00 


Q1 


1ft 












OA 


93 


22 












Q 




95 












ft 
0 


1 vV 


96 












C/1 

Oh 


103 


18 

1 u 












00 


107 


18 

IO 








o 
0 


4Q4 OA*) 

iyi-zUz 


E.R 
00 


1Q1 


18 

10 


0 167 


0 160 


1.044 






OO 




90 
ZU 








4 


OftC 01 G 

ZU0-Z 10 


9R 
ZD 


905 
zuo 




0 182 

U.I U£> 


0160 


1.139 






07 
CI 


907 
ZUf 


92 








5 


OOO OOQ 


A7 
HI 


000 
ZZZ 


1Q 


fi 935 


0 160 


1,474 






A A 
14 


00/ 
ZZ4 


94 
Z*f 












69 


007 
ZZ7 


17 












57 


OOA 

ZOO 


1ft 

10 








6 


241-273 


70 


OvM 

241 


17 

1 1 


0 919 


n ifin 

U. IOU 


1 398 

1 . <J£.U 






15 


OV!0 

z4o 


OA 
Z4 












71 

r i 


255 


17 












30 


258 


21 












39 


259 


20 












58 


261 


18 












40 


265 


20 








7 


290-342 


72 


290 


17 


0.208 


0.160 


1.300 






48 


293 


19 












31 


298 


21 












73 


301 


17 












18 


305 


23 












6 


308 


27 












10 


312 


25 












19 


316 


23 












28 


319 


22 
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Prediction of clusters for PRAME (SYFPEITHI algorithm) 



Total AAs: 509 
Total 9-mers: 501 
SYFPEITHI 17: 80 9-mers 



Cluster # 


AAs 


Epitope 


Start 


SYFPEITHI 


Epitopes/AA 








Rank 


Position 


Score 


Cluster Whole Pr. 


Ratio 






41 


326 


20 










74 


334 


17 






8 


343-363 


59 


343 


18 


0.238 0.160 


1.491 






60 


348 


18 










75 


351 


17 










20 


353 


23 










76 


355 


17 






9 


364447 


49 


364 


19 


0.250 0.160 


1.566 






32 


371 


21 










11 


372 


25 


* 








61 


375 


18 










77 


382 


17 










21 


390 


23 










78 


391 


17 










1 


394 


30 










42 


397 


20 










62 


403 


18 










33 


410 


21 










43 


418 


20 










34 


419 


21 










7 


422 


27 










2 


425 


29 










79 


426 


17 










63 


428 


18 










64 


431 


18 










12 


432 


25 










16 


435 


24 










80 


439 


17 






10 


455474 


29 


455 


22 


0.200 0.160 


1.253 






17 


459 


24 










4 


462 


28 










3 


466 


29 
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Table 57 

Predication of clusters for CEA (N1H algorithm) 



Total AAs:702 
Total 9-mers: 694 
NIH 5: 30 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


Cluster 


Peptides/AAs 
Whole Pr. 


Ratio 


1 
i 


17-32 


5 
7 

20 


17 
18 
24 


79.041 
46.873 
12.668 


0.188 


0.043 


4.388 


2 


113-129 


2 

15 


113 
121 


167.991 
21.362 


0.118 


0.043 


Z.rQO 


3 


172-187 


25 
14 


172 
179 


9.165 
27.995 


0.125 


0.043 


2.925 


4 


278-291 


30 
17 


278 
283 


5.818 
19.301 


0.143 


0.043 


3.343 


5 


350-365 


9 

12 


350 
357 


43.075 
27.995 


0.125 


0.043 


2.925 


6 


528-543 


8 
13 


528 
535 


43.075 
27.995 


0.125 


0.043 


2.925 


7 


631-645 


23 
19 
24 


631 
634 
637 


9.563 
13.381 
9.245 


0.200 


0.043 


4.680 


8 


691-702 

i 


1 

27 


691 
694 


196.407 
7.769 


0.167 


0.043 


3.900 
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Table 58 

Predication of clusters for CEA (SYFPEITHI algorithm) 

Total AAs:702 
Total 9-mers: 694 
SYFPEITHI 16:81 9-mers 



Cluster # 


AA 


Peptides 


Start 


Score 




Peptides/AAs 


Ratio 




Rank 


Position 




Cluster 


Whole Pr. 


1 


5-36 


67 


5 


16 


0.250 


0.117 


2.140 




23 


12 


19 












24 


16 


19 












9 


17 


22 












25 


18 


19 












32 


19 


18 












68 


23 


16 












33 


28 


18 








2 


37-62 


41 


37 


17 


0.269 


0.117 


2.305 






20 


44 


20 












26 


45 . 


19 












42 


46 


17 












27 


50 


19 












43 


53 


17 












44 


54 


17 








3 


99-115 


14 


99 


21 


0.235 


0.117 


2.014 






5 


100 


23 












45 


104 


17 












34 


107 


18 








4 


116-129 


69 


116 


16 


0.143 


0.117 


1.223 






21 


121 


20 








5 


172-187 


46 


172 


17 


0.125 


0.117 


1.070 






70 


179 


16 








6 


192-202 


3 


192 


24 


0.182 


0.117 


1.557 






47 


194 


17 








7 


226-241 


48 


226 


17 


0.188 


0.117 


1.605 






49 


229 


17 






!' 






15 


233 


21 








8 


307-318 


11 


307 


22 


0.250 


0.117 


2.140 






71 


308 


16 












51 


310 


17 








9 


319-349 


52 


319 


17 


0.129 


0117 


1.105 






53 


327 


17 












72 


335 


16 












35 


341 


18 








10 


370-388 


12 


370 


22 


0.211 


0.117 


1.802 






54 


372 


17 












74 


375 


16 












6 


380 


23 








11 


403-419 


56 


403 


17 


0.235 


0.117 


2.014 






57 


404 


17 












58 


407 


17 












28 


411 


19 
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19 


497-449 


*/w 


427 


17 


0.188 


0.117 


1.605 






75 

I w 


432 


16 












76 

r w 


434 


16 








1^ 
I O 


twV~*Tw^ 


77 


450 

• WW 


16 


0.154 


0.117 


1.317 


A 




13 


454 


22 








14 




36 


488 


18 

1 w 


0.167 


0.117 


1.427 






18 


4Q9 


21 

4m 1 














4Q7 

•Tw f 


17 








1^ 




4 


548 


24 


0 182 


0.117 


1.557 






61 


550 

www 


17 








1fi 




62 


565 

www 


17 


0.154 


0.117 


1.317 








569 

www 


21 








i «1"7 
1 f 


of y-oy r 


7ft 


57Q 


16 

1 w 


0 143 

W. 1 1 W 


0 117 

W. Ill 


1 223 






f y 


589 


16 












7 


580 

DO 57 


23 








<i Q 

1o 


eric ciq 


z 


DUO 


95 


0 143 

Vi. ltd 


0 117 

W > 1 1 I 


1 223 






^8 


610 

U IV/ 


18 










ooi-ooy 


zy 


DO I 


19 

1 57 


0 154 

W.I w"T 


0 117 

W. 1 1 I 


1.317 


t 




DO 


R^7 
DO/ 


17 












ou 


R44 


16 

1 w 












R4 
D*r 


R59 
Dw£ 


17 

1 r 












oy 


PAH 
DDU 


18 












ft1 
O 1 


DO 1 


16 








20 


675-702 

\J 1 W 1 Wta 


22 


675 


20 


0.286 


0.117 


2.446 






30 


683 


19 












31 


687 


19 . 












40 


688 


18 












65 


690 


17 












1 


691 


31 












66 


692 


17 












8 


694 


23 
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Table 59 _ v 

Predication of clusters for SCP-1 (NIH algorithm) 

Total AAs: 976 
Total 9-mers: 968 
NIH 5: 37 9-mers 



Cluster # 


AA 


Peptides 
Rank 


Start 
Position 


Score 


Cluster 


Peptides/AAs 
Whole Pr. 


Ratio 


1 


101-116 


15 
13 


101 
108 


40.589 
57.255 


0.125 


0.038 


3.270 


2* 


281-305 


14 
24 
17 


281 
288 

OIV7 


44.944 
15.203 


0.12 


0.038 


3.139 


3 


431-447 


8 

26 
4 


431 
438 
439 


80.217 
11.861 
148.896 


0.073 


0.038 


1.914 


4 


557-579 


11 

19 

6 

18 


557 
560 
564 
571 


64.335 
24.937 
87.586 
32.765 


0.174 


0.038 


4.550 


5 


635-650 


10 
34 


635 
642 


69.552 
6.542 


0.125 


0.038 


3.270 


6 


755-767 


36 
35 


755 
759 


5.599 
5.928 


0.154 


O038 


4.025 


7 


838-854 


2 
28 


838 
846 


284.517 
11.426 


0.118 


0.038 


3.078 
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Table 60 

Predication of clusters for SCP-1 



Total AAs: 976 
Total 9-mers: 968 
Rammensee 1 6: 1 1 8 9-mers 
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22 896-913 117 


896 16 


0.222 


0.121 


1.838 




98 


899 17 










26 


902 22 










76 


905 18 










1 MLLAVLYCLL 


WSFQTSAGHF 


PRACVSSKNL 


MEKECCPPWS 


GDRSPCGQLS 


5 


GRGSCQNILL 

61 SNAPLGPQPP 


FTGVDDRESW 


PSVFYNRTCQ 


CSGNFMGFNC 


GNCKFGFWGP 




NCTERRLLVR 

121 RNIFDLSAPE 


KDKFFAYLTL 


AKHTISSDYV 


IPIGTYGQMK 


NGSTPMFNDI 




NIYDLFVWMH 

181 YYVSMDALLG 


GSEIWRDIDF 


AHEAPAFLPW 


HRLFLLRWEQ 


EIQKLTGDEN 


10 


FTIPYWDWRD 

241 AEKCDICTDE 


YMGGQHPTNP 


NLLSPASFFS 


SWQIVCSRLE 


EYNSHQSLCN 




GTPEGPLRRN 

301 PGNHDKSRTP 


RLPSSADVEF 


CLSLTQYESG 


SMDKAANFSF 


RNTLEGFASP 


15 


LTGIADASQS 

361 SMHNALHIYM 


NGTMSQVQGS 


ANDPIFLLHH 


AFVDSIFEQW 


LRRHRPLQEV 




YPEANAPIGH 

421 NRESYMVPFI 


PLYRNGDFFI 


SSKDLGYDYS 


YLQDSDPDSF 


QDYIKSYLEQ 



ASRIWSWLLG 

481 AAMVGAVLTA LLAGLVSLLC RHKRKQLPEE KQPLLMEKED YHSLYQSHL 

20 

TYROSINASE PROTEIN 



25 



30 1 MNGDDAFARR PTVGAQIPEK IQKAFDDIAK YFSKEEWEKM KASEKIFYVY 

MKRKYEAMTK 

61 LGFKATLPPF MCNKRAEDFQ GNDLDNDPNR GNQVERPQMT FGRLQGISPK 
IMPKKPAEEG 

121 NDSEEVPEAS GPQNDGKELC PPGKPTTSEK IHERSGPKRG EHAWTHRLRE 
35 RKQLVIYEEI 

181 SDPEEDDE 

« 

SSX-2 PROTEIN 

40 



-146- 



WO 2003/008537 



PCT/US2002/010189 



1 MWNLLHETDS AVATARRPRW LCAGALVLAG GFFLLGFLFG WFIKSSNEAT 

NITPKHNMKA ^ 
61 FLDELKAENI KKFLYNFTQI PHLAGTBQNF QLAKQIQSQW KEFGLDSVBL 

AHYDVL 121 P NKTHPNYISI INEDGNBIFN TSLFEPPPPG YENVSDIVPP FSAFSPQGMP 
EGDLVYVNYA 

181 RTEDFFKLER DMKINCSGKI VIARYGKVFR GNKVKNAQLA GAKGVILYSD 
PADYFAPGVK 

241 SYPDGWNLPG GGVQRGNILN LNGAGDPLTP GYPANEYAYR RGIAEAVGLP 
SIPVHPIGYY 

301 DAQKLLEKMG GSAPPDSSWR GSLKVPYNVG PGFTGNFSTQ KVKMHIHSTN 

EVTRIYNVIG _ 

361 TIiRGAVEPDR YVILGGHRDS WVFGGIDPQS GAAWHEIVR SFGTLKKEGW 

RPRRT I Li FAS 

421 WDAEEFGLLG STEWAEENSR LLQERGVAYI NADSSIEGNY TLRVDCTPLM 
YSLVHNLTKE 

481 LKSPDEGFEG KSLYESWTKK SPSPEFSGMP RISKLGSGND FEVFFQRLGI 

ASGRAR ^1 N WETNKFSGYP LYHSVYETYE LVEKFYDPMF KYHLTVAQVR GGMVFEIiANS 

IVLPFD ^? AWLRKYADK IYSISMKHPQ EMKTYSVSFD SLFSAVKNFT EIASKFSERL 
ODFDKSNPIV 

661 LRMMNDQLMF LERAFIDPLG LPDRPFYRHV IYAPSSHNKY AGESFPGIYD 

ALFDIESKVD 

721 PSKAWGEVKR QIYVAAFTVQ AAAETLSEVA 

PSMA PROTEIN 



Is Homo sapiens tyrosinase P T rot g; 

NM 00037 (oculocutaneous albinism IA) (TYR) , Taxonomy, OMIM, LinkOut 

2 MrnA 

LOCUS NM_000372 1964 bp mRNA PRI 31 " 

OCT-2000 . T _ x 

DEFINITION Homo sapiens tyrosinase (oculocutaneous albinism IA) 

(TYR) , mRNA. 
ACCESSION NM_000372 
VERSION NM_000372.1 GI:4507752 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens t 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; . , 

Mammalia; Eutheria; Primates; Catarrhim; Homnioae; 

Homo. 

REFERENCE 1 (bases 1 to 1964) 

AUTHORS Kwon BS, Haq AK, Pomerantz SH and Halaban R. 
TITLE Isolation and sequence of a cDNA clone for human 

tyrosinase that 

maps at the mouse c -albino locus 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (21), 7473-7477 

(1987) 

MEDLINE 88041128 

PUBMED 2823263 t 

REMARK Erratum: [[published erratum appears m Proc Natl Acad 

Sci U S A 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



1988 Sep;85(17) :6352]] 
REFERENCE 2 (bases 1 to 1964) 

AUTHORS Barton DE, Kwon BS and Francke U. 

TITLE Human tyrosinase gene, mapped to chromosome 11 (ql4--- 

-q21) , . . . 

defines second region of homology with mouse 

chromosome 7 

JOURNAL Genomics 3 (1), 17-24 (1988) 

89122007 



MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 
Cohen, T. 

TITLE 
tyrosinase 

JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 
AN. 

TITLE 



3146546 

3 (bases 181 to 1964) 

Shibahara,S., Tomita,Y., Tagami,H., Muller,R.M. and 

Molecular basis for the heterogeneity of human 

Tohoku J. Exp. Med. 156 (4), 403-414 (1988) 
89222868 

4 (bases 1 to 1964) 

Bouchard B, Fuller BB, Vijayasaradhi S and Houghton 



Induction of pigmentation in mouse fibroblasts by 

expression of 

human tyrosinase cDNA 
JOURNAL J. Exp. Med. 169 (6), 2029-2042 (1989) 
MEDLINE 89279151 
PUBMED 2499655 
REFERENCE 5 (bases 1 to 1964) 

AUTHORS Takeda,A., Tomita,Y., Okinaga,S., Tagami,H. ana 

Shibahara,S. _ 
TITLE Functional analysis of the cDNA encoding human 

tyrosinase precursor 

JOURNAL Biochem. Biophys. Res. Commun. 162 (3), 984-990 (1989) 

MEDLINE 89351001 
REFERENCE 6 (bases 1 to 1964) 

AUTHORS Kikuchi H, Miura H, Yamamoto H, Takeuchi T, Dei T and 

Watanabe M. _ , 

TITLE Characteristic sequences in the upstream region ot tne 



human 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



tyrosinase gene 

Biochim. Biophys. Acta 1009 (3), 283-286 (1989) 

90089403 

2480811 



7 (bases 1 to 1964) 
Giebel LB, Strunk KM and Sprit z RA. 
Organization and nucleotide sequences of the human 

tyrosinase gene 

and a truncated tyrosinase -related segment 

Genomics 9 (3), 435-445 (1991) 

91236163 

1903356 

8 (bases 1 to 1964) 

Brichard V, Van Pel A, Wolf el T, Wolfel C, De Plaen E, 



JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

Lethe B, 



TITLE 
autologous 

JOURNAL 



Coulie P and Boon T. 

The tyrosinase gene codes for an antigen recognized by 

cytolytic T lymphocytes on HLA-A2 melanomas 
J. Exp. Med. 178 (2), 489-495 (1993) 
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MEDLINE 93340625 
PUBMED 8340755 
COMMENT PROVISIONAL REFSEQ ; This record has not yet been 

subject to final , 

NCBI review. The reference sequence was derived trom 

M27160.1. 

Location/ Qualifiers 
1. .1964 

/organism= u Homo sapiens" 
/ db_xr e f = " t axon : 9 6 0 6 " 
I chr omos ome= " 1 1 " 
/map="llql4-q21" 

/cell_line="MeWo melanoma DNA, and cDNA to 



FEATURES 

source 



mRNA" 



rene 



CDS 



IA) 



17 



/tissue_type= "placenta" 

1. .1964 

/gene="TYR" 

/note="OCAIA" 

/db_xref = "Locus ID : 7299 n 

/db_xref- W MIM : 203100 n 

83. .1672 

/gene="TYR" 

/EC_number="l . 14 . 18 . 1" 
/note= "Tyrosinase" 
/codon_start=l 
/db__xref = "LocusID : 72 99 " 
/db_xref="MIM: 203100 " 

/product^ "tyrosinase (oculocutaneous albinism 

/protein id=" NP 000363.1 " 
/db_xref="GI: 4507753" 



/ trans 1 at ion= w ML!AVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRS 
PCGQLSGRGSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGN 

CKFGFWGPNCTERRLLVRRNI FDLSAPE KDKF FAYLTLAKHT I S SDYVI P IGTYGQMK 
NGSTPMFNDINIYDLFVWMHYYVSMDALLGGSEIWRDIDFAHEAPAFLPWHRLFLLRW 

EQEIQKLTGDENFTI P YWDWRDAE KCD I CTDEYMGGQHPTNPNLLSPASFFS SWQI VC 
SRLEEYNSHQSLCNGTPEGPLRRNPGNHDKSRTPRLPSSADVEFCLSLTQYESGSMDK 
AANFS FRNTLEGFASPLTGIADASQSSMHNALHI YMNGTMSQVQGSANDPI FLLHHAF 
VDSIFEQWLRRHRPLQEVYPEANAPIGHNRESYMVPFIPLYRNGDFFISSKDLGYDYS 

YLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGLVSLLCRHKRKQLP 

EEKQPLLMEKEDYHSLYQSHL" 
sig peptide 83.. 136 

mat peptide 137.. 1669 

/note= "tyrosinase" 

/product~"tyrosinase (oculocutaneous albinism 

IA)" 

misc feature 560.. 1429 

~~ /note= "tyrosinase; Region: Common central 

domain of 
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tyrosinase 
BASE COUNT 520 a 462 c 

ORIGIN 

1 atcactgtag tagtagctgg 
5 ttcctgcaga 

61 ccttgtgagg actagaggaa 
tgtggagttt 

121 ccagacctcc gctggccatt 
tgatggagaa 
10 181 ggaatgctgt ccaccgtgga 

caggcagagg 

241 ttcctgtcag aatatccttc 

ccttcacagg 

301 ggtggatgac cgggagtcgt 

15 agtgctctgg 

361 caacttcatg ggattcaact 

caaactgcac 

421 agagagacga ctcttggtga 

agaaggacaa 
20 481 attttttgcc tacctcactt 

tcatccccat 

541 agggacctat ggccaaatga 
tcaatattta 

601 tgacctcttt gtctggatgc 
25 ggggatctga 

661 aatctggaga gacattgatt 
ggcatagact 

721 cttcttgttg cggtgggaac 
acttcactat 
30 781 tccatattgg gactggcggg 

agtacatggg 

841 aggtcagcac cccacaaatc 
cctcttggca 

901 gattgtctgt agccgattgg 
35 atggaacgcc 

961 cgagggacct ttacggcgta 
caaggctccc 

1021 ctcttcagct gatgtagaat 
gttccatgga 
40 1081 taaagctgcc aatttcagct 

cacttactgg 

1141 gatagcggat gcctctcaaa 
tgaatggaac 

1201 aatgtcccag gtacagggat 
45 atgcatttgt 

1261 tgacagtatt tttgagcagt 
tttatccaga 

1321 agccaatgca cccattggac 
taccactgta 
50 1381 cagaaatggt gatttcttta 

gctatctaca 

1441 agattcagac ccagactctt 
aagcgagtcg 

1501 gatctggtca tggctccttg 
55 ccctgctggc 

1561 agggcttgtg agcttgctgt 
aaaagcagcc 



PCT/US2002/010189 



445 g 


537 t 




aaagagaaat 


ctgtgactcc 


aattagccag 


gaatgctcct 


ggctgttttg 


tactgcctgc 

* 


tccctagagc 


ctgtgtctcc 


tctaagaacc 


qcgqqqacag 


gagtccctgt 


ggccagcttt 


tgtccaatgc 


accacttggg 


cctcaatttc 


qqccttccqt 


cttttataat 


aggacctgcc 


qtqqaaactq 


caagtttggc 


ttttggggac 


gaagaaacat 


cttcgatttg 


agtgccccag 


tagcaaagca 


taccatcagc 


tcagactatg 


aaaatgqatc 


aacacccatg 

few 


tttaacgaca 


attattatgt 


gtcaatggat 


geactgettg 

•mJ mm* — J 


ttgcccatga 


agcaccagct 


tttctgeett 


aagaaatcca 


gaagctgaca 


ggagatgaaa 


atqcaqaaaa 


gtgtgacatt 


tgcacagatg 


ctaacttact 


cagcccagca 


tcattcttct 


aqqaqtacaa 


cage cat cag 


tetttatgea 


atcctQQaaa 


ccatgacaaa 


tccagaaccc 


tttQCCtQacr 

W W W J3 ^ rj ^* J 


tttcracccaa 


tatgaatctg 


t~ h aaaaa t ac 


ac t ocraacrcra 


tttqctagtc 


acacrcatcrca 


caatQccttq 


cacatctata 


ctcrccaacqa 


tcctatcttc 


cttcttcacc 


qqctccgaag 


gcaccgtcct 


cttcaagaag 


ataaccggga 


atcctacatg 


gttcctttta 


tttcatccaa 


agatctgggc 


tatgactata 


ttcaagacta 


cattaagtcc 


tatttggaac 


gggcggcgat 


ggtaggggee 


gtcctcactg 


gtcgtcacaa 


gagaaagcag 


cttcctgaag 



-150- 



WO 2003/008537 



PCT/US2002/010189 



1621 actcctcatg gagaaagagg 
aaaaggctta 

1681 ggcaatagag tagggccaaa 
gtccaggttc 
5 1741 ccagagaata tctgctggta 

aacctaatac 

1801 aaagtgtagc cttcttccaa 
ctgttttcac 

1861 tcagcccttt taacattttc 
LO ctatttggta 

1921 atgaggaact gttatttgta 



attaccacag cttgtatcag agccatttat 
aagcctgacc tcactctaac tcaaagtaat 
tttttctgta aagaccattt gcaaaattgt 
ctcaggtaga acacacctgt ctttgtcttg 
ccctaagccc atatgtctaa ggaaaggatg 
tgtgaattaa agtgctctta tttt 



1: Homo sapiens synovial sarcoma, PubMed, P rotein, Related 

NM 0031 X breakpoint 2 (SSX2) , mRNA Sequences, Taxonomy, OMIM, 

47 LinkOut 

15 LOCUS NM_003147 766 bp mRNA PRI 14- 

MAR-2001 

DEFINITION Homo sapiens synovial sarcoma, X breakpoint 2 (SSX2) , 
mRNA. 

ACCESSION NM_003147 
20 VERSION NM_003147.1 GI: 10337582 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

25 Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae,- 

Homo. 

REFERENCE 1 (bases 1 to 766) 

AUTHORS Shipley JM, Clark J, Crew AJ, Birdsall S, Rocques PJ, 

30 Gill S, • 

Chelly J, Monaco AP, Abe S, Gusterson BA and et al. 
TITLE The t (X;18) (pll . 2 ; qll . 2 ) translocation found in human 
synovial 

sarcomas involves two distinct loci on the X 

35 chromosome 

JOURNAL Oncogene 9 (5) , 1447-1453 (1994) 
MEDLINE 94203675 
PUBMED 8152806 
REFERENCE 2 (bases 1 to 766) 
40 AUTHORS Crew , A . J . , Clark, J., Fisher, C, Gill,S., Grimer,R. , 

Chand, A. , 

Shipley, J. , Gusterson, B .A. and Cooper, C.S. 
TITLE Fusion of SYT to two genes, SSX1 and SSX2, encoding 
proteins with 

45 homology to the Kruppel -associated box in human 

synovial sarcoma 

JOURNAL EMBO J. 14 (10), 2333-2340 (1995) 
MEDLINE 95292974 
REFERENCE 3 (bases 1 to 766) 
50 AUTHORS Tureci O, Sahin U, Schobert I, Koslowski M, Scmitt H, 

Schild HJ, 

Stenner F, Seitz G, Rammensee HG and Pfreundschuh M. 
TITLE The SSX-2 gene, which is involved in the t{X;18) 
translocation of 
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synovial sarcomas, codes for the human tumor antigen 

HOM-MEL-40 

JOURNAL Cancer Res. 56 (20), 4766-4772 (1996) 
MEDLINE 96438636 
5 PUBMED 8840996 

COMMENT PROVISIONAL REFSEQt This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

X86175.1 . 

10 FEATURES Location/Qualifiers 

source 1 . .766 

/organism= ^Horno sapiens" 

/db_xr ef = " t axon : 9 6 0 6 " 

/chromosome= tt X" 
15 /map= u Xpll . 23 -pll . 22" 

/dev_stage= *adul t" 

gene 1. .766 

/gene= w SSX2" 

/note=*HD21; HOM-MEL-40 ; SSX" 
20 /db_xref = "Locus ID : 6757 * 

/db__xref= «MIM : 300192 « 
misc feature 20. .61 

/note= w Kruppel aaociated box homology" 

CDS 92 . . 658 

25 /gene= w SSX2" 

/ codon_s t ar t = 1 
/db_xref="LocusID; 6757 " 
/db_xref= tt MIM : 300192 w 

/product = "synovial sarcoma, X breakpoint 2" 
30 /protein id=" NP 003138.1 " 

/db_xref="GI : 10337583" 

/translation^ W MNGDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASE 
35 KI FYVYMKRKYEAMTKLGFKATLP PFMCNKRAEDFQGNDLDNDPNRGNQVERPQMTFG 

RLQGISPKIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKIHERSGPKRG 

EHAWTHRLRERKQLVI YEE I SDPEEDDE" 

misc feature 161. .337 
40 /note= u KRAB; Region: krueppel associated box" 

BASE COUNT 229 a 181 c 200 g 156 t 

ORIGIN 

1 ctctctttcg attcttccat actcagagta cgcacggtct gattttctct 
ttggattctt 

45 61 ccaaaatcag agtcagactg ctcccggtgc catgaacgga gacgacgcct 

ttgcaaggag 

121 acccacggtt ggtgctcaaa taccagagaa gatccaaaag gccttcgatg 
atattgccaa 

181 atacttctct aaggaagagt gggaaaagat gaaagcctcg gagaaaatct 

50 tctatgtgta 

241 tatgaagaga aagtatgagg ctatgactaa actaggtttc aaggccaccc 

tcccaccttt 

301 catgtgtaat aaacgggccg aagacttcca ggggaatgat ttggataatg 
accctaaccg 

55 361 tgggaatcag gttgaacgtc ctcagatgac tttcggcagg ctccagggaa 

tctccccgaa 

421 gatcatgccc aagaagccag cagaggaagg aaatgattcg gaggaagtgc 
cagaagcatc 
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10 



481 tggcccacaa aatgatggga aagagctgtg ccccccggga aaaccaacta 
cctctgagaa 

541 gattcacgag agate tggac ccaaaagggg ggaacatgee tggacccaca 
gactgcgtga 

601 gagaaaacag ctggtgattt atgaagagat cagcgaccct gaggaagatg 
acgagtaact 

661 cccctcaggg atacgacaca tgeccatgat gagaagcaga acgtggtgac 
ctttcacgaa 

721 catgggcatg gctgcggacc cctcgtcatc aggtgeatag caagtg 



1: Homo sapiens folate hydrolase PubMed/ Protein, Related 

MM 004 (prostate-specific membrane antigen) Sequences, Taxonomy/ OMIM, 
476 1 (FOLH1) / mRNA LinkOut 
LOCUS NM_004476 2653 bp mRNA. PRI 01- 

NOV-2000 

15 DEFINITION Homo sapiens folate hydrolase (prostate-specific 
membrane antigen) 

1 (FOLH1) , raRNA. 
ACCESSION NM_004476 
VERSION NM_004476.1 01:4758397 

20 KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

25 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo . 

REFERENCE 1 (bases 1 to 2653) 

AUTHORS Israeli,R.S. , Powell, C.T., Fair,W.R. and Heston, W.D. 
TITLE Molecular cloning of a complementary DNA encoding a 

30 prostate -specific membrane antigen 

JOURNAL Cancer Res. 53 (2), 227-230 (1993) 
MEDLINE 93113576 
REFERENCE 2 (bases 1 to 2653) 

AUTHORS Rinker-Schaeffer CW, Hawkins AL, Su SL, Israeli RS, 
35 Griffin CA, 

Isaacs JT and Heston WD. 
TITLE Localization and physical mapping of the prostate- 
specific membrane 

antigen (PSM) gene to human chromosome 11 
40 JOURNAL Genomics 30 (1), 105-108 (1995) 

MEDLINE 96129312 
PUBMED 8595888 
REFERENCE 3 (bases 1 to 2653) 

AUTHORS O'Keefe DS, Su SL, Bacich DJ, Horiguchi Y, Luo Y, 
45 Powell CT, 

Zandvliet D, Russell PJ r Molloy PL, Nowak NJ, Shows 
TB, Mullins C, 

Vonder Haar RA, Fair WR and Heston WD. 
TITLE Mapping, genomic organization and promoter analysis of 
50 the human 

prostate- specific membrane antigen gene 
JOURNAL Biochim. Biophys . Acta 1443 (1-2), 113-127 (1998) 
MEDLINE 99057588 
PUBMED 9838072 
55 REFERENCE 4 (bases 1 to 2653) 
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AUTHORS Maraj BH, Leek JP, Karayi M, Ali M, Lench NJ and 
Markham AF. 

TITLE Detailed genetic mapping around a putative prostate- 
specific 

membrane antigen locus on human chromosome llpll.2 
JOURNAL Cytogenet. Cell Genet. 81 (1), 3-9 (1998) 
MEDLINE 98358137 
PUBMED 9691167 
COMMENT PROVISIONAL REFSEQ ; This record has not yet been 
subject to final 

NCBI review. The reference sequence was derived from 



M99487.1 . 
FEATURES 

source 



lymph node 
gene 



CDS 



membrane 



membrane 



Loca t ion/Qual i f i er s 
1. .2653 

/organism^ Homo sapiens" 
/ db_xr e f = " t axon : 9606" 

/ chromos ome = M 1 1 " 
/map= w llpll-2" 

/sex=*male" 

/eel 1_1 ine = "LNCaP - ATCC" 
/cell_type= "prostate" 

/tissue^ype^prostatic carcinoma metastatic 

/tissue_lib=*LNCaP cDNA of Ron Israeli" 

1. .2653 

/gene=*FOLHl" 

/note=*FOLH; PSM; PSMA" 

/db xref="LocusID: 2346 " . 

/db xref="MIM: 600934 " 

262 . .2514 

/gene=*FOLHl" 

/note=* folate hydrolase 1 (prostate-specific 

antigen) " 
/codon_start=l 
/db_xref = u LocusID : 2346 * 
/db_xref « "MIM; 600934 " 
/ evi dence= exp er iment al 

/product^* folate hydrolase (prostate-specific 
antigen) 1" 

/protein_id= * NP 004467.1 st 
/db xref= n GI: 4758398" 



/ 1 r ans 1 at i on= W MWNLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWF I K 
SSNEATNITPKHNMKAFLDELKAENI KKFLYNFTQI PHLAGTEQNFQLAKQIQSQWKE 
FGLDSVELAHYDVLLS YPNKTHPNY I S I INEDGNE I FNTS L FEPP PPGYENVSD I VP P 
FSAFSPQGMPEGDLVYVOTARTEDFFKLERDMKINCSGKIVIARYGKVFRGNKVKNAQ 
liAGAKGVILYSD PADYFAPGVKS YPDGWNLPGGGVQRGNI LNLNGAGDPLTPGYP 
YAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKVPYNVGPGFT 
GNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGA 
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AWHBIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEENSRLLQERGVAYI 
NADSSIEGNYTLRVDCTPI^SLVHNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSG 

5 

MPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYETYELVEKFY 
DPMFKYHLTVAQVRGGMVFELANS I VLPFDCRDYAVVLRKYADKI YS I SMKHPQEMKT 
10 YSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 

DRPFYRHVI YAPS SHNKYAGES FPG I YDAL FD I ES KVD PS KAWGEVKRQ I YVAAFTVQ 

AAAETLSEVA" 
misc feature 778.. 1029 
15 /note=*PA; Region: PA domain" 

BASE COUNT 782 a 524 c 640 g 707 t 

ORIGIN 

1 ctcaaaaggg gccggatttc cttctcctgg aggcagatgt tgcctctctc 
tctcgctcgg 

20 61 attggttcag tgcactctag aaacactgct gtggtggaga aactggaccc 

caggtctgga 

121 gcgaattcca gcctgcaggg ctgataagcg aggcattagt gagattgaga 
gagactttac 

181 cccgccgtgg tggttggagg gcgcgcagta gagcagcagc acaggcgcgg 

25 gtcccgggag 

241 gccggctctg ctcgcgccga gatgtggaat ctccttcacg aaaccgactc 

ggctgtggcc 

301 accgcgcgcc gcccgcgctg gctgtgcgct ggggcgctgg tgctggcggg 
tggcttcttt 

30 361 ctcctcggct tcctcttcgg gtggtttata aaatcctcca atgaagctac 

taacattact 

421 ccaaagcata atatgaaagc atttttggat gaattgaaag ctgagaacat 
caagaagttc 

481 ttatataatt ttacacagat accacattta gcaggaacag aacaaaactt 
35 tcagcttgca 

541 aagcaaattc aatcccagtg gaaagaattt ggcctggatt ctgttgagct 
agcacattat 

601 gatgtcctgt tgtcctaccc aaataagact catcccaact acatctcaat 
aattaatgaa 

40 661 gatggaaatg agattttcaa cacatcatta tttgaaccac ctcctccagg 

atatgaaaat 

721 gtttcggata ttgtaccacc tttcagtgct ttctctcctc aaggaatgcc 
agagggcgat 

781 ctagtgtatg ttaactatgc acgaactgaa gacttcttta aattggaacg 

45 ggacatgaaa 

841 atcaattgct ctgggaaaat tgtaattgcc agatatggga aagttttcag 

aggaaataag 

901 gttaaaaatg cccagctggc aggggccaaa ggagtcattc tctactccga 
ccctgctgac 

50 961 tactttgctc ctggggtgaa gtcctatcca gatggttgga atcttcctgg 

aggtggtgtc 

1021 cagcgtggaa atatcctaaa tctgaatggt gcaggagacc ctctcacacc 
aggttaccca 

1081 gcaaatgaat atgcttatag gcgtggaatt gcagaggctg ttggtcttcc 

55 aagtattcct 

1141 gttcatccaa tfcggatacta tgatgcacag aagctcctag aaaaaatggg 

tggctcagca 
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1201 ccaccagata gcagctggag 
acctggcttt 

1261 actggaaact tttctacaca 

tgaagtgaca 
5 1321 agaatttaca atgtgatagg 

atatgtcatt 

1381 ctgggaggtc accgggactc 
tggagcagct 

14*41 gttgttcatg aaattgtgag 
10 gagacctaga 

1501 agaacaattt tgtttgcaag 
ttctactgag 

1561 tgggcagagg agaattcaag 

taatgctgac 
15 1621 tcatctatag aaggaaacta 

gtacagcttg 

1681 gtacacaacc taacaaaaga 
caaatctctt 

1741 tatgaaagtt ggactaaaaa 

20 caggataagc 

1801 aaattgggat ctggaaatga 

tgcttcaggc 

1861 agagcacggt atactaaaaa 

actgtatcac 
25 1921 agtgtctatg aaacatatga 

taaatatcac 

1981 ctcactgtgg cccaggttcg 

catagtgctc 

2041 ccttttgatt gtcgagatta 

30 aatctacagt 

2101 atttctatga aacatccaca 

ttcacttttt 

2161 tctgcagtaa agaattttac 

ccaggacttt 
35 2221 gacaaaagca acccaatagt 

tctggaaaga 

2281 gcatttattg atccattagg 

catctatgct 

2341 ccaagcagcc acaacaagta 
40 tgctctgttt 

2401 gatattgaaa gcaaagtgga 

acagatttat 

2461 gttgcagcct tcacagtgca 

ctaagaggat 
45 2521 tctttagaga atccgtattg 

cgtaatgggt 

2581 atattgataa attttaaaat 

atataaaaaa 

2641 aaaaaaaaaa aaa 



PCT/US2002/010189 

aggaagtctc aaagtgccct acaatgttgg 
aaaagtcaag atgcacatcc actctaccaa 
tactctcaga ggagcagtgg aaccagacag 
atgggtgttt ggtggtattg accctcagag 
gagctttgga acactgaaaa aggaagggtg 
ctgggatgca gaagaatttg gtcttcttgg 
actccttcaa gagcgtggcg tggcttatat 
cactctgaga gttgattgta caccgctgat 
gctgaaaagc cctgatgaag gctttgaagg 
aagtccttcc ccagagttca gtggcatgcc 
ttttgaggtg ttcttccaac gacttggaat 
ttgggaaaca aacaaattca gcggctatcc 
gttggtggaa aagttttatg atccaatgtt 
aggagggatg gtgtttgagc tagccaattc 
tgctgtagtt ttaagaaagt atgctgacaa 
ggaaatgaag acatacagtg tatcatttga 
agaaattgct tccaagttca gtgagagact 
attaagaatg atgaatgatc aactcatgtt 
gttaccagac aggccttttt ataggcatgt 
tgcaggggag tcattcccag gaatttatga 
cccttccaag gcctggggag aagtgaagag 
ggcagctgca gagactttga gtgaagtagc 
aatttgtgtg gtatgtcact cagaaagaat 
tggtatattt gaaataaagt tgaatattat 
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U20093. Human melanocyte-. . . [gi: 1142634] 

LOCUS HSPMEL17S2 2817 bp DNA PRI 20- 

NOV-1996 

DEFINITION Human melanocyte- specif ic (pmel 17) gene, exons 2-5, 
5 and complete 

cds . 

ACCESSION U20 093 
VERSION U20093.1 GI : 1142634 
KEYWORDS 
10 SEGMENT 2 of 2 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

15 Mammalia; Eutheria; Primates; Catarrhini; Hominidae,- 

Homo. 

REFERENCE 1 (bases 1 to 2817) 

AUTHORS Kim,K.K., Youn,B.S., Heng,H.H., Shi,X.M., Tsui,L.C, 
Lee, Z .H. , 

20 Pickard,R.T. and Kwon,B.S. 

TITLE Genomic organization and FISH mapping of human Pmel 

17, the 

putative silver locus 
JOURNAL Pigment Cell Res. 9 (1), 42-48 (1996) 
25 MEDLINE 96314705 

REFERENCE 2 (bases 1 to 2817) 
AUTHORS Kwon,B.S. 
TITLE Direct Submission 

JOURNAL Submitted (05-JAN-1995) Indiana University School of 
30 Medicine, 

Microbiology and Immunology, 635 Barnhill Drive, 
Indianapolis, IN 

46202, USA 
FEATURES Location/Qualif iers 

35 source 1..2817 

/organism- "Homo sapiens" 
/db_xref= n taxon:9606'' 
/ chr omos ome = n 1 2 " 
/map* * 12pt er-q2 1" 

40 /clone_lib= "lambda FIX II from Stratagene" 

gene join (U19491 . 1 : 1111 . .3109,1. .2722) 

/gene= w pmel 17 0 

CDS 

join (U19491. 1:1111. . 1186 , 67 . .459, 853 . . 1014 , 1266 . . 2396, 
45 2499.. 2722) 

/gene= tt pmel 17" 

/codon_start=l 

/product = n Pmel 17" 

/protein_id=*AAB19181 . 1" 
50 /db_xref ="GI : 1142636" 

/translation= tt MDLVLKRCLLHLAVIGALIAVGATKVPRNQDWLGVSRQLRTKAWNRQLYPEW 
TEAQRLDCWRGGQVSLKVSNDGPTLIGJ^ASFSIAl^FPGSQKVLP^^ 

QPVYPQETDDACIFPDGGPCPSGSWSQKRSFVYWKTWGQYWQVLGGPVSGLSIGTGRAMLGTHTM 
55 EVTVYHI^GSRSYVPLAHSSSAFTITDQVPFSVSVSQLRALDGGNKHFLRNQPLTFALQLHDPSGY 
LAEADLSYTWDFGDSSGTLI SRAPWTHTYLE PGPVTAQWLQAAI PLTSCGSSPVPGTTDGHRPT 
AEAPNTTAGQVPTTEWGTTPGQAPTAEPS GTTS VQVPTTEVI S TAPVQMPTAE STGMTPE KVPVS 
EVMGTTLAEMST PEATGMTPAEVS I WLSGTTAAQVTTTEWVETTARELPI PEPEGPDASS IMSTE 
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S ITGS LGPLLDGTATLRLVKRQVPLDCVLYRYGS FSVTLD I VQGI ESAE I LQAVPSGEGDAFELTV 
S CQGGL PKEACME I SSPGCQP PAQRLCQ PVL PS PACQLVLHQI LKGGSGTYCLNVSLADTNS LAW 
STQLIMPGQEAGLGQVPLIVGILLVLMAVVLASLIYRRRLMKQDFSVPQLPHSSSHWLRLPRIFCS 
C P I GENS PLLSGQQV" 
5 exon 67.. 459 

/gene= u pmel 17" 
/number«2 
exon 853.. 1014 

/gene=*pmel 17" 
10 /number =3 

exon 1266. .2396 

/gene=*pmel 17" 
/number«4 
exon 2499..>2722 
15 /gene= w pmel 17" 

/number =5 

BASE COUNT 627 a 772 c 739 g 679 t 

ORIGIN 

1 gtgctaaaaa gatgccttct tcatttggct gtgataggtg ctttgtggct 
20 gtgggggcta 

61 caaaagtacc cagaaaccag gactggcttg gtgtctcaag gcaactcaga 
accaaagcct 

121 ggaacaggca gctgtatcca gagtggacag aagcccagag acttgactgc 
tggagaggtg 

25 181 gtcaagtgtc cctcaaggtc agtaatgatg ggcctacact gattggtgca 

aatgcctcct 

241 tctctattgc cttgaacttc cctggaagcc aaaaggtatt gccagatggg 
caggttatct 

301 gggtcaacaa taccatcatc aatgggagcc aggtgtgggg aggacagcca 
30 gtgtatcccc 

361 aggaaactga cgatgcctgc atcttccctg atggtggacc ttgcccatct 
ggctcttggt 

421 ctcagaagag aagctttgtt tatgtctgga agacctgggg tgagggactc 
ccttctcagc 

35 481 ctatcatcca cacttgtgtt tacttctttc tacctgatca cctttctttt 

ggccgcccct 

541 tccaccttaa cttctgtgat tttctctaat cttcattttc ctcttagatc 
ttttctcttt 

601 cttagcacct agcccccttc aagctctatc ataattcttt ctggcaactc 
40 ttggcctcaa 

661 ttgtagtcct accccatgga atgcctcatt aggacccctt ccctgtcccc 
ccatatcaca 

721 gccttccaaa caccctcaga agtaatcata cttcctgacc tcccatctcc 
agtgccgttt 

45 781 cgaagcctgt ccctcagtcc cctttgacca gtaatctctt cttccttgct 

tttcattcca 

841 aaaatgcttc aggccaatac tggcaagttc tagggggccc agtgtctggg 
ctgagcattg 

901 ggacaggcag ggcaatgctg ggcacacaca ccatggaagt gactgtctac 
50 catcgccggg 

961 gatcccggag ctatgtgcct cttgctcatt ccagctcagc cttcaccatt 
actggtaagg 

1021 gttcaggaag ggcaaggcca gttgtagggc aaagagaagg cagggaggct 
tggatggact 

55 .1081 gcaaaggaga aaggtgaaat gctgtgcaaa cttaaagtag aagggccagg 

aagacctagg 

1141 cagagaaatg tgaggcttag tgccagtgaa gggccagcca gtcagcttgg 
agttggaggg 
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1201 tgtggctgtg aaaggagaag 
tggctccaat 

1261 cccagaccag gtgcctttct 
atggagggaa 
5 1321 caagcacttc ctgagaaatc 

accccagtgg 

1381 ctatctggct gaagctgacc 
gtggaaccct 

1441 gate tct egg gcacctgtgg 
10 tcactgccca 

1501 ggtggtcctg caggctgcca 
ttccaggcac 

1561 cacagatggg cacaggccaa 
aagtgcctac 
15 1621 tacagaagtt gtgggtacta 

ctggaaccac 

1681 atctgtgcag gtgccaacca 
tgccaactgc 

1741 agagagcaca ggtatgacac 
20 gtaccacact 

1801 ggcagagatg tcaactccag 
caattgtggt 

1861 gctttctgga accacagctg 
ccacagctag 
25 1921 agagctacct atccctgagc 

etaeggaaag 

1981 tattacaggt tccctgggcc 
tggtgaagag 

2041 acaagtcccc ctggattgtg 
30 ccctggacat 

2101 tgtccagggt attgaaagtg 

a gggggatgc 

2161 atttgagctg actgtgtcct 
tggagatctc 
35 2221 ategecaggg tgccagcccc 

ccagcccagc 

2281 ctgccagctg gttctgcacc 
gectcaatgt 

2341 gtctctggct gataccaaca 
40 tgcctggtag 

2401 gtccttggac agagactaag 
ggcaagcagc 

2461 agacatgagt gaagcag t gc 
gecttgggea 
45 2521 ggttccgctg atcgtgggca 

catctctgat 

2581 atataggege agacttatga 
atagcagcag 

2641 tcactggctg cgtctacccc 
50 atagccccct 

2701 cctcagtggg cagcaggtct 
tggagttgac 

2761 agaaacacct atatttcccc 
aaataaa 
55 // 
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ctgtggctca 


ggcctggttc 


tcaccttttc 


ccgtgagcgt 


gtcccagttg 


egggecttgg 


agcctctgac 


ctttgccctc 


cagctccatg 


tctcctacac 


ctgggacttt 


ggagacagta 


tcactcatac 


ttacctggag 


cctggcccag 


ttcctctcac 


ctcctgtggc 


tcctccccag 


ctgeagagge 


ccctaacacc 


acagctggcc 


cacctggtca 


ggcgccaact 


gcagagccct 


ctgaagtcat 


aagcactgea 


cctgtgcaga 


ctgagaaggt 


gecagtttea 


gaggtcatgg 


aggctacagg 


tatgacacct 


gcagaggtat 


cacaggtaac 


aactacagag 


tgggtggaga 


ctgaaggtcc 


agatgecage 


tcaatcatgt 


ccctgctgga 


tggtacagee 


accttaaggc 


ttctgtatcg 


atatggttcc 


ttttcegtea 


ccgagatcct 


gcaggctgtg 


ccgtccggtg 


gecaaggegg 


gctgcccaag 


gaagectgea 


ctgcccagcg 


gctgtgccag 


cctgtgctac 


agatactgaa 


gggtggctcg 


gggacatact 


gcctggcagt 


ggtcagcacc 


cagcttatca 


tgaggaggga 


agtggataga 


ggggacagct 


ctgggattct 


tctcacaggt 


caagaagcag 


tcttgctggt 


gttgatggct 


gtggtccttg 


agcaagactt 


ctccgtaccc 


cagttgccac 


gcatcttctg 


ctcttgtccc 


attggtgaga 


gagtactctc 


atatgatget 


gtgattttcc 


cagtcttccc 


tgggagacta 


ctattaactg 
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LOCUS NMJ)01648 1466 bp mRNA PRI 

31-OCT-2000 

DEFINITION Homo sapiens kallikrein 3, (prostate specific antigen) 
(KLK3) , 

5 mRNA. 
ACCESSION NM_001648 

VERSION NM_001648.1 GI : 4502172 

KEYWORDS 

SOURCE human . 

10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae; Homo. 
15 REFERENCE 1 (bases 1 to 1466) 

AUTHORS Lundwall,A. and Lilja,H. 

TITLE Molecular cloning of human prostate specific antigen 

cDNA 

JOURNAL FEBS Lett. 214 (2), 317-322 (1987) 
20 MEDLINE 87190978 

REFERENCE 2 (bases 1 to 1466) 

AUTHORS Sutherland GR, Baker E, Hyland VJ, Callen DF, Close JA, 
Tregear GW, 

Evans BA and Richards RI . 
25 TITLE Human prostate-specific antigen (APS) is a member of 

the glandular 

kallikrein gene family at 19ql3 
JOURNAL Cytogenet. Cell Genet. 48 (4), 205-207 (1988) 
MEDLINE 89250658 
30 PUBMED 2470553 

REFERENCE 3 (bases 1 to 1466) 

AUTHORS Riegman PH, Klaassen P, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Molecular cloning and characterization of novel 

35 prostate antigen 

cDNA' s 

JOURNAL Biochem. Biophys. Res. Coramun. 155 (1), 181-188 (1988) 
MEDLINE 88326297 
PUBMED 2458104 
40 REFERENCE 4 (bases 1 to 1466) 

AUTHORS Schulz P, Stucka R, Feldmann H, Combriato G, Klobeck HG 
and Fittler F. 

TITLE Sequence of a cDNA clone encompassing the complete 

mature human 

45 prostate specific antigen (PSA) and an unspliced 

leader sequence 

JOURNAL Nucleic Acids Res. 16 (13), 6226 (1988) 
MEDLINE 88289366 
PUBMED 2456523 
50 REFERENCE 5 (bases 1 to 1466) 

AUTHORS Riegman PH, Vlietstra RJ, van der Korput JA, Romijn JC 
and Trapman J. 

TITLE Characterization of the prostate-specific antigen gene: 

a novel 

55 human kallikrein- like gene 

JOURNAL Biochem. Biophys. Res. Commun. 159 (1), 95-102 (1989) 
MEDLINE 89165891 
PUBMED 2466464 
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REFERENCE 6 (bases 1 to 1466) 
AUTHORS Henttu P and Vihko P. 

TITLE cDNA coding for the entire human prostate specific 

antigen shows 

5 ~ high homologies to the human tissue kallikrein 

JOURNAL Biochem. Biophys. Res. Commun. 160 (2), 903-910 (1989) 

MEDLINE 89246551 

PUBMED 2470373 

10 REFERENCE 7 (bases 1 to 1466) 

AUTHORS Klobeck HG, Combriato G, Schulz P, Arbusow V and 

Fittler F. 

TITLE Genomic sequence of human prostate specific antigen 
(PSA) 

15 JOURNAL Nucleic Acids Res. 17 (10), 3981 (1989) 

MEDLINE 89282407 

PUBMED 2471958 

REFERENCE 8 (bases 1 to 1466) 

AUTHORS Lundwall A. 

20 TITLE Characterization of the gene for prostate-specific 

antigen, a human 

glandular kallikrein 
JOURNAL Biochem. Biophys. Res. Commun. 161 (3), 1151-1159 

(1989) 

25 MEDLINE 89302090 

PUBMED 2472789 

COMMENT PROVISIONAL REFSEQ: This record has not yet keen 

subject to final 

NCBI review. The reference sequence was derived 

30 from X05332.1. 

FEATURES Location/ Qualifiers 
source 1..1466 

/organism= w Homo sapiens" 

/ db_xr e f = ■ t a xon : 9 6 0 6 " 
35 / chr oraosome= * 1 9 " 

/map=*19ql3" 

/clone= "lambda HPSA-1" 

/tissue_type= tt prostate" 

/clone_lib«« (lambda) gtll" 

40 gene 1 . . 1466 

/gene« w KLK3" 
/note="PSA; APS" 
/db_xref = "LocusID : 3 54" 
/db_xref= w MIM: 176820" 

45 sig_j)eptide 44.. 94 

CDS 44. .829 

/gene«="KLK3" 

/EC_number= w 3 .4.21.77" 

/note= "antigen, prostate specific" 

50 /codon_start=l 

/db_xref="LocusID:354" 

/db_xref= w MIM: 176820" 

/product="kallikrein 3, (prostate specific 

antigen) " 

55 /protein_id=*NP_001639 . 1" 

/db_xref= w GI: 4502173" 
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/ 1 ransl at ions W MWVPWFLTLSVTWIGAAPL ILSRIVGGWECEKHSQPWQVLVAS 
RGRAVCGGVLVHPQWVLTAAHCI RNKSVI LLGRHSLFHPEDTGQVFQVSHS FPHPLYDMSLLKNRF 
LRPGDDS SHDLMLLRLSE PAELTDAVKVMDLPTQEPALGTTCYASGWGSIE PEEFLTPKKLQCVDL 
HVT SNDVCAQVH PQKVTKFMLCAGRWTGGKS TCSGD SGGPLVCNGVIiQG ITSWGSE PCALPERP SL 
YTKWHYRKWIKDTIVANP" 



variation 



variation 



misc_feature 
variation 



misc_feature 

protease" 

mi sc_f ea ture 

mat_peptide 

antigen) " 

variation 



variation 



variation 



variation 



variation 



variation 



variation 



91 



variation 



91 

/allele= tt C" 
/allele= w T" 

/db_xref = "dbSNP : 113 5765" 

/allele=*C" 
/allele=*T" 

/ db_xref= "dbSNP : 1 15 7 3 " 
95. .115 

/note= u propeptide (AA 1-7)" 
97 

/allele- 
/alleles *A" 

/ db_xref = MbSNP : 1 13 57 6 6 " 
113 . .802 

/note= u Tryp_SPc; Region: Trypsin-like serine 
116. .802 

/note= u trypsin; Region: Trypsin" 
116.. 826 

/product= *kallikrein 3, (prostate specific 

280 
/allele=*C" 
/alleles W T" 

/db_xref= MbSNP: 1058072" 

280 
/allele= n C" 
/allele=*T" 

/db_xref= "dbSNP : 12 946" 
526 

/ allele= ^A" 
/allele^ n G" 

/db_xref = w dbSNP : 1803130" 
643 

/alleles *A" 
/allele=*G" 

/db_xref=«dbSNP: 1803129" 
788 

/note= "WARNING: map location ambiguous" 

/allele="T" 

/alleles tt C" 

/db_xref = w dbSNP : 1802720" 
844 

/allele= w T" 
/allele= w C" 

/db_xref= w dbSNP : 1058205" 
894 

/allele= u A" 
/allele="G" 

/db_xref= M dbSNP: 1058274" 
906 
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variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

variation 

misc_f eature 
polyA_site 



/allele= w T" 
/allele= tt A" 

/db_xref= w dbSNP: 1803132" 
918 

/allele="T" 
/allele^C" 

/db_xref="dbSNP: 1802722" 
953 

/allele= w A" 
/allele- W G" 

/db_xref=«dbSNP: 1802721" 
1017 

/allele^T" 
/allele="C" 

/db_xref=MbSNP: 1803138" 
1074 

/allele= w C" 
/allele= w T" 

/db_xref= tt dbSNP: 12101" 
1107 

/allele=*A" 
/allele= w G" 

/db_xref = u dbSNP : 18 03 13 1" 
1107 

/allele="G" 
/allele= n A" 
/db_xref= w dbSNP:6998" 
1211 

/allele="C" 
/allele= tt T" 

/db__xref- tt dbSNP:1803136" 
1229 

/allele- U A" 
/allele= n G" 

/ db_xr e f = * dbSNP : 1 8 0 3 1 3 5 " 
1249 

/allele= n T" 
/allele=*C" 

/db_xref= u dbSNP: 1803134" 
1273 

/allele= w A" 
/allele= n G" 

/db_xref= u dbSNP: 12040" 
1326 

/allele= n T" 
/allele= tt C" 

/db_xref== w dbSNP: 1803133" 
1344 

/allele="A" 
/allele= w G" 

/db_xref="dbSNP : 12 863 " 
1363 

/allele="A" 
/allele=*G" 

/db_xref:=*dbSNP: 1803137" 
1445. .1450 

/ note= "put . polyA signal " 
1466 



-163- 



WO 2003/008537 



PCT/US2002/010189 



/note="polyA site" 
BASE COUNT 338 a 382 c 422 g 324 t 

ORIGIN 

1 agccccaagc ttaccacctg cacccggaga gctgtgtgtc accatgtggg 

5 tcccggttgt 

61 cttcctcacc ctgtccgtga cgtggattgg tgctgcaccc ctcatcctgt 

ctcggattgt 

121 gggaggctgg gagtgcgaga agcattccca accctggcag gtgcttgtgg 
cctctcgtgg 

10 181 cagggcagtc tgcggcggtg ttctggtgca cccccagtgg gtcctcacag 

ctgcccactg 

241 catcaggaac aaaagcgtga tcttgctggg tcggcacagc ctgtttcatc 
ctgaagacac 

301 aggccaggta tttcaggtca gccacagctt cccacacccg ctctacgata 

15 tgagcctcct 

361 gaagaatcga ttcctcaggc caggtgatga ctccagccac gacctcatgc 

tgctccgcct 

421 gtcagagcct gccgagctca cggatgctgt gaaggtcatg gacctgccca 
cccaggagcc 

20 481 agcactgggg accacctgct acgcctcagg ctggggcagc attgaaccag 

aggagttctt 

541 gaccccaaag aaacttcagt gtgtggacct ccatgttatt tccaatgacg 
tgtgtgcgca 

601 agttcaccct cagaaggtga ccaagttcat gctgtgtgct ggacgctgga 

25 cagggggcaa 

661 aagcacctgc tcgggtgatt ctgggggccc acttgtctgt aatggtgtgc 

ttcaaggtat 

721 cacgtcatgg ggcagtgaac catgtgccct gcccgaaagg ccttccctgt 
acaccaaggt 

30 781 ggtgcattac cggaagtgga tcaaggacac catcgtggcc aacccctgag 

cacccctatc 

841 aaccccctat tgtagtaaac ttggaacctt ggaaatgacc aggccaagac 
tcaagcctcc 

901 ccagttctac tgacctttgt ccttaggtgt gaggtccagg gttgctagga 

35 aaagaaatca 

961 gcagacacag gtgtagacca gagtgtttct taaatggtgt aattttgtcc 

tctctgtgtc 

1021 ctggggaata ctggccatgc ctggagacat atcactcaat ttctctgagg 
acacagatag 

40 1081 gatggggtgt ctgtgttatt tgtggggtac agagatgaaa gaggggtggg 

atccacactg 

1141 agagagtgga gagtgacatg tgctggacac tgtccatgaa gcactgagca 
gaagctggag 

1201 gcacaacgca ccagacactc acagcaagga tggagctgaa aacataaccc 

45 actctgtcct 

1261 ggaggcactg ggaagcctag agaaggctgt gagccaagga gggagggtct 

tcctttggca 

1321 tgggatgggg atgaagtaag gagagggact ggaccccctg gaagctgatt 
cactatgggg 

50 1381 ggaggtgtat tgaagtcctc cagacaaccc tcagatttga tgatttccta 

gtagaactca 

1441 cagaaataaa gage tgt tat actgtg 

// 



55 



U87459. Human autoimmunog. . . [gi: 1890098] 

LOCUS HSU87459 752 bp tnRNA PRI 

22-DEC-1999 

-164- 



WO 2003/008537 



PCT/US2002/010189 



DEFINITION Human autoimmunogenic cancer/testis antigen NY-ESO- 
1 mRNA, cotnplete cds. 
ACCESSION U87459 

VERSION U87459.1 GI: 1890098 

5 KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 
Vertebrata ; Euteleos tomi ; 
10 Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 
REFERENCE 1 (bases 1 to 752) 

AUTHORS Chen,Y.T., Scanlan / M. J. , Sahin,U., Tureci,0., 

Gure,A.O. , Tsang, S . , 
15 Williamson/ B. , Stockert,E., Pf reundschuh,M. and 

Old, L.J. 

TITLE A testicular antigen aberrantly expressed in human 

cancers detected 

by autologous antibody screening 
20 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (5) , 1914-1918 (1997) 

MEDLINE 97203161 

PUBMED 9050879 

REFERENCE 2 (bases 1 to 752) 

AUTHORS Chen , Y . - T . 
25 TITLE Direct Submission 

JOURNAL Submitted (28- JAN- 1997) Ludwig Institute for Cancer 

Research, New 

York Branch, 1275 York Avenue, New York, NY 10021, 

USA 

30 FEATURES Location/Qualifiers 

source 1..752 

/organism^ *Homo sapiens" 
/ db_x r e f = w t axon : 9 6 0 6 " 

/clone_lib= "esophageal squamous cell 

35 carcinoma expression 

cDNA library" 
CDS 54.. 596 

/codon_start=l 

/products ^autoimmunogenic cancer/testis 

40 antigen NY-ESO-1" 

/protein_id= tt AAB49693 .1" 
/db_xref=*GI: 1890099" 

/ 1 rans 1 a t i on= "MQAEGRGTGGSTGDADGPGGPG I PDGPGGNAGGPGEAG 
45 ATGGRGPRGAGAARASGPGGGAPRGPHGGAASGLNGCCRCGARGPESRLLEF 

YLAMPFATPMEAELARRSLAQDAPPLPVPGVLLKEFTVSGNILTIRLTAADH 
RQLQLSISSCLQQLSLLMWITQCFLPVFLAQPPSGQRR" 

BASE COUNT 126 a 230 c 256 g 140 t 

ORIGIN 

50 1 atcctcgtgg gccctgacct tctctctgag agccgggcag aggctccgga 

gccatgcagg 

61 ccgaaggccg gggcacaggg ggttcgacgg gcgatgctga tggcccagga 
ggccctggca 

121 ttcctgatgg cccagggggc aatgctggcg gcccaggaga ggcgggtgcc 
55 acgggcggca 

181 gaggtccccg gggcgcaggg gcagcaaggg cctcggggcc gggaggaggc 
gccccgcggg 
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241 


gtccgcatgg 


cggcgcggct 


tcagggctga 


atggatgctg 


cagatgcggg 




gccagggggc 














301 


cggagagccg 


cctgcttgag 


ttctacctcg 


ccatgccttt 


cgcgacaccc 




atggaagcag 












5 


361 

ggggtgcttc 


agctggcccg 


caggagcctg 


gcccaggatg 


ccccaccgct 


tcccgtgcca 




421 


tgaaggagtt 


cactgtgtcc 


ggcaacatac 


tgactatccg 


actgactgct 




gcagaccacc 












10 


481 


gccaactgca 


gctctccatc 


agctcctgtc 


tccagcagct 


ttccctgttg 


atgtggatca 














541 


cgcagtgctt 


tctgcccgtg 


tttttggctc 


agcctccctc 


agggcagagg 




cgctaagccc 














601 


agcctggcgc 


cccttcctag 


gtcatgcctc 


ctcccctagg 


gaatggtccc 


15 


agcacgagtg 












661 


gccagttcat 


tgtgggggcc 


tgattgtttg 


tcgctggagg 


aggacggctt 



acatgtttgt 

721 ttctgtagaa aataaaactg agctacgaaa aa 

// 



20 CAA11116. LAGE- la protein [... [gi : 3255959] 

LOCUS CAA11116 180 aa PRI 

23-JUN-1998 

DEFINITION LAGE -la protein [Homo sapiens] . 
ACCESSION CAA11116 
25 PID g3255959 

VERSION CAA11116.1 GI: 3255959 

DBSOURCE embl locus HOS223093, accession AJ223093.1 

KEYWORDS 

SOURCE human . 

30 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata / Eut eleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; 

Homini dae ; Homo . 
35 REFERENCE 1 (residues 1 to 180) 

AUTHORS Lethe,B., Lucas, S., Michaux,L., De Smet,C, 
Godelaine,D. , 

Serrano/A. , De Plaen,E. and Boon,!. 

TITLE LAGE-1, a new gene with tumor specificity 

40 JOURNAL Int. J. Cancer 76 (6), 903-908 (1998) 

MEDLINE 98289662 

REFERENCE 2 (residues 1 to 180) 

AUTHORS Lethe, B .G. 

TITLE Direct Submission 

45 JOURNAL Submitted (08-JAN-1998) Lethe B.G., Brussels Branch, 

Ludwig 

Institute for Cancer Research, 74 Avenue 
Hippocrate, B - 1200 - 

Bruxelles, BELGIUM 
50 COMMENT Related sequences: AJ223040, AJ223041 and AJ003149. 

FEATURES Location/Qualifiers 
source 1 . . 180 

/organism= *Homo sapiens" 
/isolate= "individual LB33" 
55 /db_xref=*taxon:9606" 

/chromosome= "X" 
/map= n q28" 
/clone= M c2RB" 
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Protein 
CDS 



/sex=*Female" 
/ ce 1 1J. ine= * LB3 3 - MELA" 
/eel l_typ e= n me 1 anoma * 
/clone_lib=«B6" 
1. .180 

/product= tt LAGE-la protein" 
1. .180 

/ gene = w LAGE - 1 " 

/ db_xr e f = n S PTREMBL : 07 5 6 3 7 " 

/codedJt>y=* join (AJ223093. 1:1231. .1499, 

AJ223093. 1:2115. . 2249, AJ223093 .1:2479. .2617) " 
ORIGIN 

1 mqaegrgtgg stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 
arasgprgga 

61 prgphggaas aqdgrcpcga rrpdsrllel hitmpfsspm eaelvxrils 
rdaaplprpg 

121 avlkdftvsg nllfirltaa dhrqlqlsis sclqqlsllm witqcflpvf 
laqapsgqrr 
181 

// 



CAA11117, LAGE -lb protein [ . . . [gi :3255960] 
LOCUS CAA11117 210 aa 

23-JUN-1998 

DEFINITION LAGE- lb protein [Homo sapiens] . 

ACCESSION CAA11117 

PID g3255960 

VERSION CAA11117.1 GI: 3255960 

DBSOURCE enibl locus HOS223093, accession AJ2 23 093 . 1 
KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 

Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates ; 

Hominidae ; Homo . 

REFERENCE 1 (residues 1 to 210) 

AUTHORS Lethe , B . , Lucas , S . , Mi chaux , L . , De 
Godelaine,D. , 

Serrano, A., De Plaen,E. and Boon,T. 
LAGE-1, a new gene with tumor specificity 
Int. J. Cancer 76 (6), 903-908 (1998) 
98289662 



PRI 



Craniata; 
Catarrhini ; 

Smet , C . , 



TITLE 

JOURNAL 

MEDLINE 



REFERENCE 2 (residues 1 to 210) 



Lethe, B.G. 
Direct Submission 

Submitted (08- JAN-1998) Lethe B.G., Brussels Branch, 

for Cancer Research, 74 Avenue 



AUTHORS 
TITLE 
JOURNAL 
Ludwig 

Institute 

Hippocrate, B - 1200 - 

Bruxelles, BELGIUM 
COMMENT Related sequences: AJ223040, AJ223041 and AJ003149. 
FEATURES Location/Qualifiers 
source 1- .210 

/organism= w Homo sapiens" 
/isolate= n individual LB33 
/db xref="taxon:9606" 
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5 



Protein 
10 CDS 



15 

2 boundary" 
ORIGIN 

1 mgaegrgtgg stgdadgpgg pgipdgpggn aggpgeagat ggrgprgaga 
arasgprgga 

20 61 prgphggaas aqdgrcpcga rrpdsrllel hitmpfsspm eaelvrrils 

rdaaplprpg 

121 avlkdftvsg nllfmsvwdq dregagrrarv vgwglgsasp egqkardlrt 
pkhkvseqrp 

181 gtpgppppeg aqgdgcrgva fnvmfsaphi 

25 // 



30 M77481. Human antigen (MA. .. [gi : 416114] 

LOCUS HUMMAG1A 2420 bp DNA PRI 

15-NOV-1993 

DEFINITION Human antigen (MAGE-1) gene, complete cds . 
ACCESSION M77481 
35 VERSION M77481.1 GI: 416114 

KEYWORDS antigen. 

SOURCE Homo sapiens (individual_isolate patient MZ2) 

melanoma abdominal 

metastasis of melanoma DNA. 

40 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; 

Vertebrata ; Eut eleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo . 
45 REFERENCE 1 (bases 785 to 1286) 

AUTHORS van der Bruggen, P . , Traversari , C . , Chomez , P . , 
Lurquin,C. , De 

Plaen,E., Van den Eynde,B., Knuth,A. and Boon,T. 
TITLE A gene encoding an antigen recognized by cytolytic T 

50 lymphocytes on 

a human melanoma 
JOURNAL Science 254, 1643-1647 (1991) 
MEDLINE 92086861 
REFERENCE 2 (bases 1 to 2420) 
55 AUTHORS van der Bruggen P. 

TITLE Direct Submission 

JOURNAL Submitted (05-FEB-1992) Pierre van der Bruggen, Ludwig 
Institute 

-168- 



/chromosome= W X" 
/map^^S" 
/clone= w c2RB" 
/sex= "Female" 
/cell_line=*LB33-MELA" 
/ cell_type= "melanoma" 
/clone_lib= n B6" 
1. .210 

/product= n LAGE-lb protein" 
1. .210 

/gene= tt LAGE-l" 
/ db_xr e f= n S PTREMBL : O 7 5 6 3 8 ' 
/coded_by=" join(AJ223093 . 1 : 1231 . . 1499 , 
AJ223093. 1:2115. .2478)" 

/not e= "alternate splicing at exon 2 - intron 
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for Cancer Research, Brussels Branch, Avenue 
Hippocrate, 74, UCL 

7459, Brussels, B-1200, Belgium 

On Nov 15, 1993 this sequence version replaced 



COMMENT 
gi: 187294. 
FEATURES 

source 



intron 



exon 



mRNA 



intron 



exon 

gene 
CDS 



Location/Qualifiers 
1. .2420 

/organism^ w Homo sapiens" 
/db__xr ef = tt t axon : 9 6 0 6 " 
/map= M X" 
/sex= w female" 
/cell_line= U MZ2 -MEL . 43 » 
1. .412 
/partial 
/gene= n MAGE-l" 
/number= 1 
413. .485 

/gene= W MAGE - 1 * 
/number =2 
join(413. .485,561. .2111) 
/partial 
/gene= n MAGE-l" 
486. .560 
/gene= u MAGE-l" 
/number^ 
561. .2111 
gene=*MAGE-l" 
/number =3 
626. .1555 
/gene= n MAGE-l" 
626. .1555 
/gene=*MAGE-l" 
/ codon_start=l 
/pr oduc t = *MAGE - 1 " 
/protein_id= w AAA03229 . 1" 
/db_xref = *GI : 4 16 1 15 " 

/ trans la tion= "MSLEQRSLHCKPEEALEAQQEALGLVCVQAATSSSSPL 
VLGTLEEVPTAGSTDPPQSPQGASAFPTTINFTRQRQPSEGSSSREEEGPST 
SCILESLFRAVITKKVADLVGFLLLKYRAREPVTKAEMLESVIKNYKHCFPE 
IFGKASESLQLVFGIDVKEADPTGHSYVLVTCLGLSYDGLLGDNQIMPKTGF 
LI IVLVMIAMEGGHAPEEE IWEELSVMEVYDGREHSAYGEPRKLLTQDLVQE 
KYLEYRQVPDSDPARYEFLWGPRALAETSYVKVLEYVIKVSARVRFFFPSLR 
EAALREEEEGV" 
polyA_signal 2095. .2100 

/gene=*MAGE-l" 
intron 2112.. 2420 

/partial 
gene= M MAGE-l" 
/number =3 

BASE COUNT 562 a 582 c 677 g 599 t 

ORIGIN 

1 ggatccaggc cctgccagga aaaatataag ggccctgcgt gagaacagag 
ggggtcatcc 

61 actgcatgag agtggggatg tcacagagtc cagcccaccc tcctggtagc 
actgagaagc 

121 cagggctgtg cttgcggtct gcaccctgag ggcccgtgga ttcctcttcc 
tggagctcca 
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181 ggaaccaggc agtgaggcct tggtctgaga cagtatcctc aggtcacaga 
gcagaggatg 

241 cacagggtgt gccagcagtg aatgtttgcc ctgaatgcac accaagggcc 
ccacctgcca 

5 301 caggacacat aggactccac agagtctggc ctcacctccc tactgtcagt 

cctgtagaat 

361 cgacctctgc tggccggctg taccctgagt accctctcac ttcctccttc 
aggttttcag 

421 gggacaggcc aacccagagg acaggattcc ctggaggcca cagaggagca 

10 ccaaggagaa 

481 gatctgtaag taggcctttg ttagagtctc caaggttcag ttctcagctg 

aggcctctca 

541 cacactccct ctctccccag gcctgtgggt cttcattgcc cagctcctgc 
ccacactcct 

15 601 gcctgctgcc ctgacgagag tcatcatgtc tcttgagcag aggagtctgc 

actgcaagcc 

661 tgaggaagcc cttgaggccc aacaagaggc cctgggcctg gtgtgtgtgc 
aggctgccac 

721 ctcctcctcc tctcctctgg tcctgggcac cctggaggag gtgcccactg 

20 ctgggtcaac 

781 agatcctccc cagagtcctc agggagcctc cgcctttccc actaccatca 

acttcactcg 

841 acagaggcaa cccagtgagg gttccagcag ccgtgaagag gaggggccaa 
gcacctcttg 

25 901 tatcctggag tccttgttcc gagcagtaat cactaagaag gtggctgatt 

tggttggttt 

961 tctgctcctc aaatatcgag ccagggagcc agtcacaaag gcagaaatgc 
tggagagtgt 

1021 catcaaaaat tacaagcact gttttcctga gatcttcggc aaagcctctg 

30 agtccttgca 

1081 gctggtcttt ggcattgacg tgaaggaagc agaccccacc ggccactcct 

atgtccttgt 

1141 cacctgccta ggtctctcct atgatggcct gctgggtgat aatcagatca 
tgcccaagac 

35 1201 aggcttcctg ataattgtcc tggtcatgat tgcaatggag ggcggccatg 

ctcctgagga 

1261 ggaaatctgg gaggagctga gtgtgatgga ggtgtatgat gggagggagc 
acagtgccta 

1321 tggggagccc aggaagctgc tcacccaaga tttggtgcag gaaaagtacc 

40 tggagtaccg 

1381 gcaggtgccg gacagtgatc ccgcacgcta tgagttcctg tggggtccaa 

gggccctcgc 

1441 tgaaaccagc tatgtgaaag tccttgagta tgtgatcaag gtcagtgcaa 
gagttcgctt 

45 1501 tttcttccca tccctgcgtg aagcagcttt gagagaggag gaagagggag 

tctgagcatg 

1561 agttgcagcc aaggccagtg ggagggggac tgggccagtg caccttccag 
ggccgcgtcc 

1621 agcagcttcc cctgcctcgt gtgacatgag gcccattctt cactctgaag 

50 agagcggtca 

1681 gtgttctcag tagtaggttt ctgttctatt gggtgacttg gagatttatc 

tttgttctct 

1741 tttggaattg ttcaaatgtt tttttttaag ggatggttga atgaacttca 
gcatccaagt 

55 1801 ttatgaatga cagcagtcac acagttctgt gtatatagtt taagggtaag 

agtcttgtgt 

1861 tttattcaga ttgggaaatc cattctattt tgtgaattgg gataataaca 
gcagtggaat 

■ 
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1921 aagtacttag 
ctaaagaaat 

1981 taagagatag 
ttaaagatat 

2041 atgcatacct 
tctgaataaa 

2101 gaattcttcc 
ctttttggaa 

2161 ggccctgggt 
ccatagggtc 

2221 gtagagtcta 
aaagatgtag 

2281 ggaaaagtga 
agtgtcaatg 

2341 ccctgagctg 
gagctgattg 

2401 taatgatctt 

// 



aaatgtgaaa 


aatgagcagt 


aaaatagatg 


agataaagaa 


tcaattcttg 


ccttatacct 


cagtctattc 


tgtaaaattt 


ggatttcctt 


ggcttctttg 


agaatgtaag 


agaaattaaa 


tgttcactgg 


ctcttttctt 


ctccatgcac 


tgagcatctg 


tagtagtgga 


gatgctaagg 


taagccagac 


tcatacccac 


ggagctgcag 


tcacgtaatc 


gaggtggcaa 


gatgtcctct 


gagaggggtg 


agggtgtggg 


gctccgggtg 


agagtggtgg 


gggcattttg 


ggctttggga 


aactgcagtt 


ccttctgggg 



gggtggatcc 



DNA 



PRI 



L18920. Human MAGE -2 gene ... [gi : 436180] 
LOCUS HUMMAGE2X 4559 bp 

20-APR-1994 

DEFINITION Human MAGE -2 gene exons 1-4, complete cds. 
ACCESSION L18920 
VERSION L18920.1 GI:436180 

KEYWORDS 

SOURCE Homo sapiens (human) . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; 

Vertebrat a ; Euteleos tomi ; 

Mammalia; Butheria; Primates; 

Homini dae ; Homo . 

REFERENCE 1 (bases 1 to 4559) 

AUTHORS De Sme t , C . , Lurguin , C . , van der Bruggen , P . , De 
Plaen,E. , 

Brasseur,F. and Boon,T. 

TITLE Sequence and expression pattern of the human MAGE 2 gene 

JOURNAL Immunogenetics 39 (2), 121-129 (1994) 

MEDLINE 94102805 
FEATURES Location/ Qualifiers 



Craniata; 
Catarrhini ; 



source 



gene 
exon 

intron 
exon 



1. .4559 

/organism^ *Homo sapiens" 
/isolate=. u MZ-2" 
/db_xref = u taxon : 9 6 0 6 * 
/map= w X" 

/haplotype= n Al A29 B37 B44 C6" 

/sex= w f emale" 

/cell_line="MZ2-MEL . 43" 

/cell_type= "melanoma cell line" 
561. .4496 

/gene»*MAGE-2" 
561. .754 

/gene=*MAGE-2" 

/number =1 
755. .1850 

/gene= u MAGE-2" 

/number^ 1 
1851. .1969 
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/gene= w MAGE-2" 
/number =2 
1970. .2788 
/gene= w MAGE-2" 
/nuraber= 2 
2789. .2854 
/gene= w MAGE-2" 
/number^ 3 
2855. .2934 
/gene= w MAGE-2" 
/number= 3 
2935. .4496 
/gene= tt MAGE~2" 
/number= 4 
3000. .3944 

/genes 11 MAGE -2" 
/codon_s tart = 1 
/protein_icl=*AAA17729 .1" 
/db_xrefs n GI: 436181" 

/ 1 r ans 1 a t i on= w MPLEQRS QHCKPEEGLEARGEALGLVGAQAPATEEQQTAS S S STLVEVT 

IiGEVPAADSPSPPHSPQGASSFSTTINYTLWRQSDEGSSNQEEEGPRMFPDLE 

SEFQAAI SRKMVELVHFLLLKYRARE PVTKAEMLESVLRNCQDFPPVI FSKASE YLQLVFGI E 

VVEWPISHLYIIjVTCLGLSYDGLLGDNQVMPKTGLLIIVIiAIIAIEGDCAPEEKIWEELSML 

EVFEGREDSVFAHPRKLLMQDLVQE^LEYRQVPGSDPACreFLWGPRALIETSYVKV^ 

KIGGEPHISYPPLHERALREGEE" 

polyA_signal 4479 . . 4484 

/gene= n MAGE-2" 
BASE COUNT 1049 a 1280 c 1283 g 947 t 

ORIGIN 

1 attccttcat caaacagcca ggagtgagga agaggaccct cctgagtgag 
gactgaggat 

61 ccaccctcac cacatagtgg gaccacagaa tccagctcag cccctcttgt 
cagccctggt 

121 acacactggc aatgatctca ccccgagcac acccctcccc ccaatgccac 
ttcgggccga 

181 ctcagagtca gagacttggt ctgaggggag cagacacaat cggcagagga 
tggcggtcca 

241 ggctcagtct ggcatccaag tcaggacctt gagggatgac caaaggcccc 
tcccaccccc 

301 aactcccccg accccaccag gatctacagc ctcaggatcc ccgtcccaat 
ccctacccct 

361 acaccaacac catcttcatg cttaccccca cccccccatc cagatcccca 
tccgggcaga 

421 atccggttcc acccttgccg tgaacccagg gaagtcacgg gcccggatgt 
gacgccactg 

481 acttgcacat tggaggtcag aggacagcga gattctcgcc ctgagcaacg 
gcctgacgtc 

541 ggcggaggga agcaggcgca ggctccgtga ggaggcaagg taagacgccg 
agggaggact 

601 gaggcgggcc tcaccccaga cagagggccc ccaataatcc agcgctgcct 
ctgctgccgg 

661 gcctggacca ccctgcaggg gaagacttct caggctcagt cgccaccacc 
tcaccccgcc 

721 accccccgcc gctttaaccg cagggaactc tggcgtaaga gctttgtgtg 
accagggcag 

781 ggctggttag aagtgctcag ggcccagact cagccaggaa tcaaggtcag 
gaccccaaga 



intron 

exon 

intron 

exon 

CDS 
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841 ggggactgag ggcaacccac 
aacaccaacc 

901 ccacccccat ccctcaaaca 
tcctccccca 
5 961 ccaccatcct ggcagaatcc 

ctccgggaat 

1021 ggcggccaag cacgcggatc 
aaggggttgg 

1081 gtctcgtgag tatggccttt 
10 gaagacagtg 

1141 gagtccttag gggacccagc 
gtctcaaact 

1201 gagccacctt ttcattcagc 
cagcaggggg 
15 1261 ttggggccca gcctgcgagg 

aggggacctt 

1321 ggagtccaga tcagtggcaa 
ccgaatgtgc 

1381 cccgtgctca ttgcaccttc 
20 gagggctggg 

1441 acttcaggtc agcagaggga 
tgcccccttc 

1501 atgaggactg gggatacccc 
tggaagtccc 

25 1561 ttgttcttag ctctggggga 

ctcatttgta 

1621 ccacaggcag gaggttgggg 
agaggagctg 

1681 tctgctcatt tcagggggtt 
30 agtaaagatg 

1741 agtaacccac aggaggccat 
agccctggac 

1801 aacgcacgtg ggggtaacag 
atctcaggga 
35 1861 gttgatgacc ttgttttcag 

tctggtcgac 

1921 agatgcagtg gttctaggat 
taggattgag 

1981 ggtacccctg ggccagaatg 
40 gcccctgcgg 

2041 ttacttcaga gaccctgggc 

ctgggatctt 

2101 tgatgtcagg gaaggggagg 

gt agagggag 
45 2161 ggtctcaggc cctgccagga 

ccaggacacc 

2221 tggactccaa tgaatttgga 

ggtcacgtat 

2281 ggccagatgt gggtcccctc 

50 gttcttgaca 

2341 tgagagattc tcaagccagc 
aaggtgaggg 

2401 ccctgagtga gcacagaggg 
acggagtctg 
55 2461 gccaaccctg ctgagacttc 

acactgaagg 

2521 cccgtgcatt cctctcccag 
ggccttggtc 



PCT/US2002/010189 



cccctaccct 


cactaccaat 


cccatccccc 


ccaaccccac 


ccccaaaccc 


cattcccatc 


ggctttgccc 


ctgcaatcaa 


cccacggaag 


ctgacgttca 


catgtacggc 


taagggaggg 


gggatgcaga 


ggaagggccc 


aggcctcctg 


atgccaggac 


agggggccca 


ctgtacccct 


cgagggaatc 


ctagggatgc 


agacccactt 


agtcaagggg 


aggaagaaga 


gggaggactg 


ccttgggctg 


ggggatcctg 


ggcacagtgg 


agggtgacag 


agagttgagg 


gctgtggtct 


ggaatcccag 


gatctgccgg 


acccaaggtg 


cggcccagaa 


agaagggatg 


ccacagagtc 


acctgatcag 


ggatggccct 


aagtgacaat 


aaccctcagg 


gagataaggt 


gttggtgtaa 


gggggttgag 


aaagggcagt 


ccctggcagg 


cataacgttc 


accctagaac 


caaaggggtc 


gatgtggccc 


ctcctcactt 


gtctttccag 


aaggtgactc 


aggtcaacac 


aggggcccca 


ctgccaagca 


tccaggtgga 


gagcctgagg 


cagcaagggg 


gccccataga 


aatctgccct 


agggctgtca 


gctgaagtcc 


ctccattatc 


ccttggtctg 


aaggggctgg 


agtcaggtca 


gtggacgtga 


ggaccaagcg 


gactcgtcac 


catctctcgt 


tgtccttcgc 


gggaggacct 


atatccttct 


gtaccatatc 


agggatgtga 


aaaagggtgg 


gattaggccc 


tacaaggaga 


gaccctccac 


ccaagtagag 


tggggacctc 


tgggaatccg 


tggctgtgct 


tgcagtctgc 


gaatcaggag 


ctccaggaac 


caggcagtga 
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2581 tgagtcagtg tcctcaggtc 
cactgaaggt 

2641 ttgcctggaa tgcacaccaa 
ccagagggcc 
5 2701 tggcctcacc ctccctattc 

ggctgtaccc 

2761 tgaggtgccc tcccacttcc 
aagtaggacc 

2821 cgaggcactg gaggagcatt 
10 cagagcctcc 

2881 aaggttcagt tcagttctca 
ccaggcctgt 

2941 gggtcttcat tgcccagctc 
agagtcatca 
15 3001 tgcctcttga gcagaggagt 

gcccgaggag 

3061 aggccctggg cctggtgggt 
accgcttctt 

3121 cctcttctac tctagtggaa 
20 tcaccgagtc 

3181 ctccccacag tcctcaggga 
actctttgga 

3241 gacaatccga tgagggctcc 
tttcccgacc 

25 3301 tggagtccga gttccaagca 

cattttctgc 

3361 tcctcaagta tcgagccagg 
agtgtcctca 

3421 gaaattgcca ggacttcttt 
30 ttgcagctgg 

3481 tctttggcat cgaggtggtg 
cttgtcacct 

3541 gcctgggcct ctcctacgat 
aagacaggcc 

35 3601 tcctgataat cgtcctggcc 

gaggagaaaa 

3661 tctgggagga gctgagtatg 
gtcttcgcac 

3721 atcccaggaa gctgctcatg 
40 taccggcagg 

3781 tgcccggcag tgatcctgca 
ctcattgaaa 

3841 ccagctatgt gaaagtcctg 
cacatttcct 

45 3901 acccacccct gcatgaacgg 

gcacatgttg 

3961 cagccagggc cagtgggagg 
catccattag 

4021 cttccactgc ctcgtgtgat 
50 gcagtcagca 

4081 ttcttagcag tgagtttctg 
ctttcctgtt 

4141 ggaattgttc aaatgttcct 
ccaagtttat 

55 4201 gaatgacagt agtcacacat 

tcctgttttt 

4261 tattcagatt gggaaatcca 
gcagtggaat 



acagagcaga 


ggggacgcag 


acagtgccaa 


gggccccacc 


cgcccagaac 


aaatgggact 


tcagtcctgc 


agcctgagca 


tgtgctggcc 


tccttcaggt 


tctgaggggg 


acaggctgac 


gaaggagaag 


atctgtaagt 


aagcctttgt 


cctaaggcct 


cacacacgct 


ccttctctcc 


ctgcccgcac 


tcctgcctgc 


tgccctgacc 


cagcactgca 


agcctgaaga 


aggccttgag 


gcgcaggctc 


ctgctactga 


ggagcagcag 


gttaccctgg 


gggaggtgcc 


tgctgccgac 


gcctccagct 


tctcgactac 


catcaactac 


agcaaccaag 


aagaggaggg 


gccaagaatg 


gcaatcagta 


ggaagatggt 


tgagttggtt 


gagccggtca 


caaaggcaga 


aatgctggag 


cccgtgatct 


tcagcaaagc 


ctccgagtac 


gaagtggtcc 


ccatcagcca 


cttgtacatc 


ggcctgctgg 


gcgacaatca 


ggtcatgccc 


ataatcgcaa 


tagagggcga 


ctgtgcccct 


ttggaggtgt 


ttgaggggag 


ggaggacagt 


caagatctgg 


tgcaggaaaa 


ctacctggag 


tgctacgagt 


tcctgtgggg 


tccaagggcc 


caccatacac 


taaagatcgg 


tggagaacct 


gctttgagag 


agggagaaga 


gtgagtctca 


gggtctgggc 


cagtgcacct 


tccagggccc 


atgaggccca 


ttcctgcctc 


tttgaagaga 


ttctgttgga 


tgactttgag 


atttatcttt 


tttaacaaat 


ggttggatga 


acttcagcat 


agtgctgttt 


atatagttta 


ggggtaagag 


ttccattttg 


tgagttgtca 


cataataaca 
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4321 atgtatttgc 
ggaactcaaa 

4381 agatagttaa 
tatgtgtatg 

4441 tttttgcttc 
ctgttcactg 

4501 gctcatttct 
agtagtggg 
// 



ctatattgtg aacgaattag cagtaaaata catgatacaa 

ttcttgcctt atacctcagt ctattatgta aaattaaaaa 

tttgagaatg caaaagaaat taaatctgaa taaattcttc 

ttaccattca ctcagcatct gctctgtgga aggccctggt 



PRI 



Craniata; 
Catarrhini ; 



U03735. Human MAGE -3 anti ... [gi : 468825] 
LOCUS HSU03735 4204 bp DNA 

07 -APR- 1994 

DEFINITION Human MAGE -3 antigen (MAGE- 3) gene, complete cds. 
ACCESSION U03735 

VERSION U03735.1 GI:468825 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; 

Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; 

Hominidae ; Homo . 

REFERENCE 1 (bases 1 to 4204) 

AUTHORS Gaugler , B . , Van den Eynde , B . , van der Bruggen , P . , 
Romero, P. , 

Gaforio, J.J. f De Plaen,E., Lethe ,B., Brasseur,F. 

and Boon,T. 

TITLE Human gene MAGE -3 codes for an antigen recognized on 

a melanoma by 

autologous cytolytic T lymphocytes 
JOURNAL J. Exp. Med. 179, 921-930 (1994) 
MEDLINE 94157413 
REFERENCE 2 (bases 1 to 4204) 
AUTHORS Gaugl er , B . 
TITLE Direct Submission 

JOURNAL Submitted (25-NOV-1993) Beatrice Gaugler, Ludwig 
Institute for 

Cancer Research, 74 Avenue Hippocrate, Brussels 

1200, Belgium 

FEATURES Location/ Qualifiers 



source 



exon 
exon 
exon 
gene 



1. .4204 
/organism= w Homo sapiens" 
/isolate= n patient MZ2" 
/db_xref = n taxon: 9606" 
/chromosome= *X" 
/ s ex= * f eraa le * 
/eel l_type = v lymphocyte " 
/tissue_type= w blood" 
/ dev — s t age = w adul t " 

434. .468 
/ number =1 

2254. .2319 
/number=2 

2400. .3978 
/number=3 

2465. .3409 
/gene="MAGE-3" 
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CDS 2465.. 3409 

/gene^MAGE-3" 
/codonjstart=l 
/products n MAGE- 3 antigen" 
5 /protein id="AAA17446.1" 

/dbjxref I n GI : 468826" 

/translations "MPLEQRSQHCKPEEGLEARGEALGLVGAQAPATEEQEAASSSSTLVEVTLGE 
VPAAESPDPPQSPQGASSLPTTMNYPLWSQSYEDSSNQEEEGPSTFPDLESEFQAALSRKVAELVH 
1 0 FLLLKYRARE PVTKAEMLGS WGNWQYFFPVI FSKAS S SLQLVFGI ELMEVD PI GHLYI FATCLGL 

SYDGLLGDNQIMPKAGLLI IVLAI IAREGDCAPEEKIWEELSVLEVFEGREDS ILGDPKKLLTQHF 
VQENYLE YRQVPGSD PACYE FLWGPRALVETS YVKVLHHMVKI SGGPHI S YP PLHEWVLREGEE " 
polyA_signal 3958. .3963 
polyA_site 3979 
15 BASE COUNT 944 a 1144 c 1223 g 893 t 

ORIGIN 

1 acgcaggcag tgatgtcacc cagaccacac cccttccccc aatgccactt 
cagggggtac 

61 tcagagtcag agacttggtc tgaggggagc agaagcaatc tgcagaggat 
20 ggcggtccag 

121 gctcagccag gcatcaactt caggaccctg agggatgacc gaaggccccg 
cccacccacc 

181 cccaactccc ccgaccccac caggatctac agcctcagga cccccgtccc 
•aatccttacc 

25 241 ccttgcccca tcaccatctt catgcttacc tccaccccca tccgatcccc 

atccaggcag 

301 aatccagttc cacccctgcc cggaacccag ggtagtaccg ttgccaggat 
gtgacgccac 

361 tgacttgcgc attggaggtc agaagaccgc gagattctcg ccctgagcaa 
30 cgagcgacgg 

421 cctgacgtcg gcggagggaa gccggcccag gctcggtgag gaggcaaggt 
aagacgctga 

481 gggaggactg aggcgggcct cacctcagac agagggcctc aaataatcca 
gtgctgcctc 

35 541 tgctgccggg cctgggccac cccgcagggg aagacttcca ggctgggtcg 

ccactacctc 

601 accccgccga cccccgccgc tttagccacg gggaactctg gggacagagc 
ttaatgtggc 

661 cagggcaggg ctggttagaa gaggtcaggg cccacgctgt ggcaggaatc 
40 aaggtcagga 

721 ccccgagagg gaactgaggg cagcctaacc accaccctca ccaccattcc 
cgtcccccaa 

781 cacccaaccc cacccccatc ccccattccc atccccaccc ccacccctat 
cctggcagaa 

45 841 tccgggcttt gcccctggta tcaagtcacg gaagctccgg gaatggcggc 

caggcacgtg 

901 agtcctgagg ttcacatcta cggctaaggg agggraagggg ttcggtatcg 
cgagtatggc 

961 cgttgggagg cagcgaaagg gcccaggcct cctggaagac . agtggagtcc 
50 tgaggggacc 

1021 cagcatgcca ggacaggggg cccactgtac ccctgtctca aaccgaggca 
ccttttcatt 

1081 cggctacggg aatcctaggg atgcagaccc acttcagcag ggggttgggg 
cccagccctg 

55 1141 cgaggagtca tggggaggaa gaagagggag gactgagggg accttggagt 

ccagatcagt 

1201 ggcaaccttg ggctggggga tgctgggcac agtggccaaa tgtgctctgt 
gctcattgcg 
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1261 ccttcagggt 
caggtcagca 

1321 gagggaggaa 
ccctatgtgg 

1381 tggacagatg 
tgagggagga 

1441 ttgagggtac 
gccctgctcc 

1501 tgctgttacc 
ttatcctagg 

1561 atcactgatg 
gggcagtaga 

1621 gggaggctct 
ctcacccagg 

1681 gtacatggac 
gacctgggaa 

1741 tgtatggcca 
tgtgagttct 

1801 tgacatgaga 
ggagaaaggt 

1861 gagggccctg 
acctcacaga 

1921 gtctggccaa 
tctgcacatt 

1981 gggggcccgt 
agtgaggact 

2041 tggtctgagg 
gccaacggtg 

2101 aaggtttgcc 

ggactccaga 

2161 gcgcctggcc 

tggccggatg 

2221 taccctgagg 
tgacctggag 

2281 gaccagaggc 

tttgttagag 

2341 cctccaaggt 

tctccccagg 

2401 ccagtgggtc 
tgaccagagt 

2461 catcatgcct 
ttgaggcccg 

2521 aggagaggcc 
aggaggctgc 

2581 ctcctcctct 
ccgagtcacc 

2641 agatcctccc 
actaccctct 

2701 ctggagccaa 
gcaccttccc 

2761 tgacctggag 
tggttcattt 

2821 tctgctcctc 
tggggagtgt 

2881 cgtcggaaat 
gttccttgca 

2941 gctggtcttt 
acatctttgc 



gaccagagag ttgagggctg tggtctgaag agtgggactt 
tcccaggatc tgcagggccc aaggtgtacc cccaaggggc 
cagtggtcct aggatctgcc aagcatccag gtgaagagac 
ccctgggaca gaatgcggac tgggggcccc ataaaaatct 
tcagagagcc tgggcagggc tgtcagctga ggtccctcca 
tcagggaagg ggaagccttg gtctgagggg gctgcactca 
cagaccctac taggagtgga ggtgaggacc aagcagtctc 
ttcaataaat ttggacatct ctcgttgtcc tttccgggag 
gatgtgggtc ccctcatgtt tttctgtacc atatcaggta 
gattctcagg ccagcagaag ggagggatta ggccctataa 
agtgagcaca gaggggatcc tccaccccag tagagtgggg 
ccctcctgac agttctggga atccgtggct gcgtttgctg 
ggattcctct cccaggaatc aggagctcca ggaacaaggc 
cagtgtcctc aggtcacaga gtagaggggg ctcagatagt 
ttggattcaa accaagggcc ccacctgccc cagaacacat 
tcaccctcaa tactttcagt cctgcagcct cagcatgcgc 
tgccctctca cttcctcctt caggttctga ggggacaggc 
ccccggagga gcactgaagg agaagatctg taagtaagcc 
tccattcagt actcagctga ggtctctcac atgctccctc 
tccattgccc agctcctgcc cacactcccg cctgttgccc 
cttgagcaga ggagtcagca ctgcaagcct gaagaaggcc 
ctgggcctgg tgggtgcgca ggctcctgct actgaggagc 
tctactctag ttgaagtcac cctgggggag gtgcctgctg 
cagagtcctc agggagcctc cagcctcccc actaccatga 
tcctatgagg actccagcaa ccaagaagag gaggggccaa 
tccgagttcc aagcagcact cagtaggaag gtggccgagt 
aagtatcgag ccagggagcc ggtcacaaag gcagaaatgc 
tggcagtatt tctttcctgt gatcttcagc aaagcttcca 
ggcatcgagc tgatggaagt ggaccccatc ggccacttgt 
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3001 cacctgcctg ggcctctcct acgatggcct gctgggtgac aatcagatca 
tgcccaaggc 

3061 aggcctcctg ataatcgtcc tggccataat cgcaagagag ggcgactgtg 
cccctgagga 

5 3121 gaaaatctgg gaggagctga gtgtgttaga ggtgtttgag gggagggaag 

acagtatctt 

3181 gggggatccc aagaagctgc tcacccaaca tttcgtgcag gaaaactacc 
tggagtaccg 

3241 gcaggtcccc ggcagtgatc ctgcatgtta tgaattcctg tggggtccaa 
10 gggccctcgt 

3301 tgaaaccagc tatgtgaaag tcctgcacca tatggtaaag atcagtggag 
gacctcacat 

3361 ttcctaccca cccctgcatg agtgggtttt gagagagggg gaagagtgag 
tctgagcacg 

15 3421 agttgcagcc agggccagtg ggagggggtc tgggccagtg caccttccgg 

ggccgcatcc 

3481 cttagtttcc actgcctcct gtgacgtgag gcccattctt cactctttga 
agcgagcagt 

3541 cagcattctt agtagtgggt ttctgttctg ttggatgact ttgagattat 
20 tctttgtttc 

3601 ctgttggagt tgttcaaatg ttccfctttaa cggatggttg aatgagcgtc 
agcatccagg 

3661 tttatgaatg acagtagtca cacatagtgc tgtttatata gtttaggagt 
aagagtcttg 

25 3721 ttttttactc aaattgggaa atccattcca ttttgtgaat tgtgacataa 

taatagcagt 

3781 ggtaaaagta tttgcttaaa attgtgagcg aattagcaat aacatacatg 
agataactca 

3841 agaaatcaaa agatagttga ttcttgcctt gtacctcaat ctattctgta 
30 aaattaaaca 

3901 aatatgcaaa ccaggatttc cttgacttct ttgagaatgc aagcgaaatt 
aaatctgaat 

3961 aaataattct tcctcttcac tggctcgttt cttttccgtt cactcagcat 
ctgctctgtg 

35 4021 ggaggccctg ggttagtagt ggggatgcta aggtaagcca gactcacgcc 

tacccatagg 

4081 gctgtagagc ctaggacctg cagtcatata attaaggtgg tgagaagtcc 
tgtaagatgt 

4141 agaggaaatg taagagaggg gtgagggtgt ggcgctccgg gtgagagtag 
40 tggagtgtca 

4201 gtgc 

// 

AF043498. Homo sapiens pros ... [gi : 2909843] 
45 LOCUS AF043498 990 bp mRNA PRI 24- 

FEB-1998 

DEFINITION Homo sapiens prostate stem cell antigen (PSCA) 
mRNA, complete cds. 
ACCESSION AF043498 
50 VERSION AF043498.1 GI: 2909843 

KEYWORDS 

SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
55 Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (bases 1 to 990) 
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AUTHORS Reiter,R.E., Gu,Z., Watabe,T., Thomas, G. , Kinga,S., 
Davis , E . , 

Wahl,M., Nisitani,S., Yamashiro, J. , Le Beau,M.M., 

Losa,M. and 
5 Witte,O.N. 

TITLE Prostate stem cell antigen: a cell surface marker 

overexpressed in 

prostate cancer 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 95 (4), 1735-1740 (1998) 
10 MEDLINE 98132661 

REFERENCE 2 (bases 1 to 990) 
AUTHORS Reiter,R.E. 
TITLE Direct Submission 

JOURNAL Submitted (19-JAN-1998) Urology, UCLA, 66-134 CHS 
15 10833 Le Conte 

Ave., Los Angeles, CA 90095, USA 
FEATURES Loca t i on/ Qual i f i er s 

source 1..990 

/organism=*Homo sapiens" 
20 /db_xref= w taxon:9606" 

/ chr omo s ome = * 8 " 
/map= u 8q24.2" 

/note= w LAPC-4 prostate cancer xenograft" 
gene 1..990 
25 /gene= w PSCA ff 

CDS 18.. 389 

/gene= n PSCA" 

/note= w GPI-anchored cell surface protein" 
/ codon_start=l 

30 /product= w prostate stem cell antigen" 

/protein_id= w AAC39607 . 1" 
/db_xref = tt GI : 2909844" 

/ translations W MKAVLLALLMAGLALQPGTALLCTSCKAQV^ 
35 TARIRAVGLLTVISKGCSLNCTODSQDYYVGKKNITC 
LGLLLWGPGQL" 

BASE COUNT 193 a 299 c 285 g 202 t 11 others 

ORIGIN 

1 agggagaggc agtgaccatg aaggctgtgc tgcttgccct gttgatggca 
40 ggcttggccc 

61 tgcagccagg cactgccctg ctgtgctact cctgcaaagc ccaggtgagc 
aacgaggact 

121 gcctgcaggt ggagaactgc acccagctgg gggagcagtg ctggaccgcg 
cgcatccgcg 

45 181 cagttggcct cctgaccgtc atcagcaaag gctgcagctt gaactgcgtg 

gatgactcac 

241 aggactacta cgtgggcaag aagaacatca cgtgctgtga caccgacttg 
tgcaacgcca 

301 gcggggccca tgccctgcag ccggctgccg ccatccttgc gctgctccct 
50 gcactcggcc 

361 tgctgctctg gggacccggc cagctatagg ctctgggggg ccccgctgca 
gcccacactg 

421 ggtgtggtgc cccaggcctt tgtgccactc ctcacagaac ctggcccagt 
gggagcctgt 

55 481 cctggttcct gaggcacatc ctaacgcaag tttgaccatg tatgtttgca 

ccccttttcc 

541 ccnaaccctg accttcccat gggccttttc caggattccn accnggcaga 
tcagttttag 
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601 tganacanat ccgcntgcag atggcccctc caaccntttn tgttgntgtt 
t ccatggccc 

661 agcattttcc acccttaacc ctgtgttcag gcacttnttc ccccaggaag 
ccttccctgc 

5 721 ccaccccatt tatgaattga gccaggtttg gtccgtggtg tcccccgcac 

ccagcagggg 

781 acaggcaatc aggagggccc agtaaaggct gagatgaagt ggactgagta 
gaactggagg 

841 acaagagttg acgtgagttc ctgggagttt ccagagatgg ggcctggagg 

10 cctggaggaa 

901 ggggccaggc ctcacatttg tggggntccc gaatggcagc ctgagcacag 

cgtaggccct 

961 taataaacac ctgttggata agccaaaaaa 

// 

15 

P06870. GLANDULAR KALLIKR. . . [gi: 125170] 

LOCUS KLK1JHUMAN 262 aa PRI 20- 

AUG-2001 

DEFINITION GLANDULAR KALLIKREIN 1 PRECURSOR (TISSUE 

20 KALLIKREIN) 

( KI DNE Y / PANCREAS / SAL I VARY GLAND KALLIKREIN) . 
ACCESSION P06870 
PID gl25170 
VERSION P06870 GI: 125170 

25 DBSOURCE swissprot: locus KLK1 — HUMAN , accession P06870; 

class: standard. 

extra accessions :Q9UMJ1, created: Jan 1, 1988. 
sequence updated: Jan 1, 1988. 
annotation updated: Aug 20, 2001. 
30 xrefs: gi : gi : 186652, gi : gi : 186653, gi : gi : 186649, 

gi: gi: 

186651, gi: gi: 186645, gi : gi : 186646, gi : gi : 

186647, gi: gi: 

186648, gi: gi: 34026, gi : gi : 34027, gi : gi: 186643, 

35 gi: gi: 

386843, gi: gi : 67558 

xrefs (non- sequence, databases) : HSSP P00757, MEROPS 

S01.160, 

GlycoSuiteDB P06870, MIM 147910, InterPro IPR001314, 

40 InterPro 

IPR001254, Pfam PF00089, PRINTS PR00722, PROSITE 

PS50240, PROSITE 

PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Glycoprotein; Multigene 

45 family; 

Zymogen; Signal . 
SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

50 Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

* Homo . 

REFERENCE 1 (residues 1 to 262) 

AUTHORS Fukushima , D . , Ki tamura , N . and Nakanishi , S . 
55 TITLE Nucleotide sequence of cloned cDNA for human 

pancreatic kallikrein 

JOURNAL Biochemistry 24, 8037-8043 (1985) 
REMARK SEQUENCE FROM N.A. 
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TISSUE=Pancreas 
REFERENCE 2 (residues 1 to 262) 

AUTHORS Evans, B. A., Yun,Z.X., Close, J .A. , Tregear , G . W . , 

Ki t amur a N > 

'Nakanishi,S. , Callen,D.F., Baker, E. , Hyland,V.J., 
Sutherland, G . R . 

and Richards, R. I . 
TITLE Structure and chromosomal localization of the human 

renal 

kallikrein gene 

Biochemistry. 27 (9), 3124-3129 (1988) 
88269498 
2898948 

SEQUENCE FROM N.A. 
TISSUE=Kidney 

3 (residues 1 to 262) 

Angermann, A . , Bergmann, C . and Appelhans , H . 

Cloning and expression of human salivary-gland 
kallikrein in 

Escherichia coli 

The Biochemical journal. 262 (3), 787-793 (1989) 
90073574 
2686621 

SEQUENCE FROM N.A. 
TISSUE=Salivary gland 

4 (residues 1 to 262) 
Baker,A.R. and Shine, J. 

Human kidney kallikrein: cDNA cloning and sequence 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REMARK 



REFERENCE 

AUTHORS 

TITLE 
analysis 

JOURNAL 

MEDLINE 
PUBMED 

REMARK 

REFERENCE 
AUTHORS 
TITLE 

and sites of 

JOURNAL 
33 (4), 

MEDLINE 
PUBMED 
REMARK 



DNA (Mary Ann Liebert, Inc.) 4 (6), 445-450 (1985) 
86135264 
3853975 

SEQUENCE OF 17-262 FROM N.A. 
TISSUE=Kidney 
5 (residues 1 to 262) 
Lu,H.S., Lin,F.K., Chao,L. and Chao,J. 

Human urinary kallikrein. Complete amino acid sequence 
glycosylation 

International journal of peptide and protein research. 



237-249 (1989) 

89326688 

2666327 

SEQUENCE OF 25-262. 
TISSUE-Urine 
6 (residues 1 to 262) 

Kellermann, J. , Lottspeich, F. , 



Geiger,R. and 



REFERENCE 
AUTHORS 
Deutzmann,R. 

TITLE Human urinary kallikrein- -amino acid sequence and 

carbohydrate 

attachment sites 

JOURNAL Protein sequences & data analysis. 1 (3), 177-182 
(1988) 

MEDLINE 88203586 

PUBMED 3163150 
REMARK SEQUENCE OF 25-262, AND CARBOHYDRATE -LINKAGE SITES . 

TISSUE=Urine 
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50 



55 



REFERENCE 7 (residues 1 to 262) 

AUTHORS Lottspeich,F. , Geiger,R., Henschen,A. and Kutzbach,C. 

TITLE N-Terminal amino acid sequence of human urinary 

kallikrein homology 

with other serine proteases 

Hoppe-Seyler's Zeitschrift fur physiologische Chemie. 



1947-1950 (1979) 

80114126 

393608 

SEQUENCE OF 25-55. 

TISSUE-Urine 

8 (residues 1 to 262) 

Takahashi , S . , Irie,A. , Katayama, Y. , Ito,K. and 



JOURNAL 
360 (12), 

MEDLINE 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
Miyake, Y. 

TITLE N-terminal amino acid sequence of human urinary 

prokallikrein 

JOURNAL Journal of biochemistry. 99 (3), 989-992 (1986) 
MEDLINE 86223893 

PUBMED 363553 0 
REMARK SEQUENCE OF 28-47. 

TISSUE=Urine 

[FUNCTION] GLANDULAR KALLIKREINS CLEAVE MET -LYS AND 
ARG-SER BONDS 

IN KININOGEN TO RELEASE LYS -BRADYKININ . 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE OF ARG- | - 

XAA BONDS IN 

SMALL MOLECULE SUBSTRATES. HIGHLY SELECTIVE ACTION TO 

RELEASE 

KALLIDIN ( LYS YL - BRADYKININ ) FROM KININOGEN INVOLVES 
HYDROLYSIS OF 

MET - | -XAA OR LEU- | -XAA. 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY SI? ALSO 

KNOWN AS THE 

TRYPSIN FAMILY. KALLIKREIN SUBFAMILY. 



FEATURES 

source 



Protein 



Region 



Region 



Region 



Bond 



Bond 



Location/Qualifiers 
1. .262 

/organism= a Homo sapiens" 
/db_xref= u taxon:9606" 
1. .262 
1. .262 

/product= U GLANDULAR KALLIKREIN 1 PRECURSOR" 
/EC_number=*3 .4.21.35" 
1. .18 

/ r egion jaame = " S igna 1 " 
/ not e = w PROBABLE . " 
19. .24 

/region_name=*Propeptide" 
/note= tt ACTIVATION PEPTIDE (PROBABLE)." 
25.-262 

/region_name= M Mature chain" 
/note= n GLANDULAR KALLIKREIN 1." 
bond(31,174) 
/bond_type= Misulf ide" 
/note="BY SIMILARITY." 
bond (50, 66) 

/bond_type= Mi sulfide" 
/note=*BY SIMILARITY." 
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Site 



Site 



Site 



Site 



Site 



Site 



Region 



Bond 



Site 



Site 



Bond 



Region 



Bond 



Site 



65 

/site_type= "active" 

/ no t e = * CHARGE RELAY SYSTEM." 

93 

/site _type= "glycosylation" 

/note= "O-LINKED . " 

102 

/site_type= w glycosylation" 
/note= "N-LINKED (GLCNAC. . . ) ." 
104 

/site_type= "glycosylation" 

/note=*0-LINKED." 

108 

/site_type=* n glycosylation" 
/note= "N-LINKED (GLCNAC ...)." 
120 

/site_type= tt active" 

/ not e« "CHARGE RELAY SYSTEM . " 

145 

/ r egion_name= "Var iant * 

/note= tt Q -> E. /FTId=VAR_006625 . " 

bond (153, 220) 

/bond__type= "disulfide" 

/note=*BY SIMILARITY." 

165 

/site_type= w glycosylation" 

/note= "N- LINKED (GLCNAC . . . ) ( PARTIAL) . " 

167 

/site_type= n glycosylation" 
/no te="0- LINKED. " 

bond (185 ,199) 
/bond_type= "disulfide" 

/note="BY SIMILARITY." 

186 

/region_name= "Variant" 

/note="E -> K. /FTId=VAR_006626." 

bond (210 , 235) 

/bond_type= "di sulfide" 

/note- "BY SIMILARITY." 

214 

/sit e_type« "active" 

/not e= "CHARGE RELAY SYSTEM." 



ORIGIN , -. u, ^ 

1 mwflvlclal slggtgaapp iqsrivggwe ceqhaqpwqa alyntsttqc 

ggilvhrqwv 

61 ltaahcisdn yqlwlgrhnl fddentaqfv hvsesfphpg fnmsllenht 
rqadedyshd 

121 lmllrltepa dtitdavkw elptqepevg stclasgwgs lepenfsfpd 

dlqcvdlkil . 

181 pndecekahv qkvtdfmlcv ghleggkdtc vgdsggplmc dgvlqgvtsw 

gyvpcgtpnk 

241 psvavrvlsy vkwiedtiae ns 

// 



P08217. ELASTASE 2A PRECU. [gi : 119255] 

LOCUS EL2A_HUMAN 269 aa PRI 

AUG-2001 



183- 



WO 2003/008537 



PCT/US2002/010189 



PID 

VERSION 
DBSOURCE 



DEFINITION ELASTASE 2A PRECURSOR. 
ACCESSION P08217 

gll9255 

P08217 GI:119255 

swissprot: locus EL2A_HUMAN, accession P08217; 
class: standard, 
created: Aug 1, 1988. 
sequence updated: Aug 1, 1988. 
annotation updated: Aug 20, 2001. 

xrefs: gi: gi: 182022, gi: gi: 182023, gi : gi: 182057, 

gi : gi : 

182058, gi: gi : 88298, gi : gi: 88299 

xrefs (non-sequence databases): MEROPS S01. 155, 

InterPro IPR001314, 

InterPro IPR001254, Pfara PF00089, PRINTS PR00722, 

PROSITE PS50240, 

PROSITE PS00134, PROSITE PS00135 
KEYWORDS Hydrolase; Serine protease; Pancreas; Zymogen; Signal. 
SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; 

Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

REFERENCE 1 (residues 1 to 269) 

AUTHORS Kawashima , I . , Tani ,T. , Shimoda , K . and Takiguchi , Y . 

TITLE Characterization of pancreatic elastase II cDNAs : two 

elastase II 

mRNAs are expressed in human pancreas 
JOURNAL DNA 6 (2), 163-172 (1987) 
MEDLINE 87217962 
REMARK SEQUENCE FROM N.A. 
REFERENCE 2 (residues 1 to 269) 

AUTHORS Fletcher,T.S. , Shen,W.F. and Largman,C. 

TITLE Primary structure of human pancreatic elastase 2 

determined by 

sequence analysis of the cloned mRNA 
Biochemistry 26 (23), 7256-7261 (1987) 
88107669 

SEQUENCE FROM N.A. 
[FUNCTION] ACTS UPON ELASTIN. 

[CATALYTIC ACTIVITY] PREFERENTIAL CLEAVAGE: LEU- 1 -XAA, 



JOURNAL 
MEDLINE 
REMARK 



MET- | -XAA 



AND PHE- | -XAA. HYDROLYSES ELASTIN. 
[SUBCELLULAR LOCATION] SECRETED. 
[TISSUE SPECIFICITY] PANCREAS. 

[SIMILARITY] BELONGS TO PEPTIDASE FAMILY SI; ALSO 



KNOWN AS THE 

FEATURES 

source 



Region 
Protein 



TRYPSIN FAMILY. ELASTASE SUBFAMILY. 
Loca t ion/ Quali f ier s 
1. .269 

/organism^ "Homo sapiens" 
/db_xref« « taxon : 9 60 6" 
1. .16 

/ region_name = * Signal " 
1. .269 

/product = "ELASTASE 2A PRECURSOR" 
/EC number- *3. 4. 2 1.71" 
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Region 17.. 2 8 

/region_name= w Propeptide" 

/note=*ACTIVATION PEPTIDE . " 
Region 29.. 2 69 

5 /region_name= "Mature chain" 

/note=*ELASTASE 2A." 
Bond bond (58 ,74) 

/bond_type= "disulfide" 

/note«"BY SIMILARITY . * 
10 Site 73 

/site_type= w active" 

/note=*CHARGE RELAY SYSTEM (BY SIMILARITY) ." 
Site 121 

/site_type=*active" 
15 / no t e = * CHARGE RELAY SYSTEM (BY SIMILARITY) ." 

Bond bond (155, 222) 

/bond_type= "disulfide" 
/note="BY SIMILARITY." 
Bond bond (186, 202) 

20 /bond_type=Misulfide" 

/note= w BY SIMILARITY." 
Bond bond (212 , 243 ) 

/bond_type= "disulfide" 
/note= w BY SIMILARITY." 
25 Site 216 

/site_type= "active" 

/ note= "CHARGE RELAY SYSTEM (BY SIMILARITY) ." 

ORIGIN 

1 mirtlllstl vagalscgdp typpyvtrw ggeearpnsw pwqyslqyss 
30 ngkwyhtcgg 

61 slianswvlt aahcisssrt yrvglgrhnl yvaesgslav svskiwhkd 
wnsnqiskgn 

121 diallklanp vsltdkiqla clppagtilp nnypcyvtgw grlqtngavp 
dvlgqgrllv 

35 181 vdyatcsssa wwgssvktsm icaggdgvis scngdsggpl ncqasdgrwq 

vhgivsfgsr 

241 lgcnyyhkps vftrvsnyid winsviann 

// 

40 NP_056933. pancreatic elasta ... [gi : 7705648] 

LOCUS NP_056933 269 aa PRI 02- 

NOV-2000 

DEFINITION pancreatic elastase I IB [Homo sapiens] . 
ACCESSION NP_056933 
45 PID g7705648 

VERSION . NP_056933.1 GI: 7705648 
DBSOURCE REFSEQ: accession NM_015849.1 
KEYWORDS 

SOURCE human . 

50 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

55 REFERENCE 1 (residues 1 to 269) 

AUTHORS Kawashima , I . , Tani ,T. , Shimoda , K . and Takiguchi , Y . 
TITLE Characterization of pancreatic elastase II cDNAs: two 

elastase II 
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mRNAs are expressed in human pancreas 
JOURNAL DNA 6 (2), 163-172 (1987) 
MEDLINE 87217962 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 

M16653 .1. 

Location/ Qualifiers 
1. .269 

/organism="Homo sapiens" 
/db_xref="taxon:9606" 
/ chromosome= u 12 " 
/map="12" 
1. .269 

/product= "pancreatic elastase I IB" 
1..16 

/note= "pancreatic elastase IIB signal 



FEATURES 

source 



Protein 
sig_peptide 



peptide" 

Region 



mat_peptide 



peptide" 

Region 



CDS 



28. .262 

/region_name="Trypsin-like serine protease" 

/db_xref="CDD:Tryp_SPc" 

/note="Tryp_SPc" 

29.. 269 . 

/product^ "pancreatic elastase IIB mature 
31. .262 

/ r eg i on_name « " Tryp s in" 
/db_xref="CDD:pfam00089" 

/not e = " tryps in" 

1. .269 

/gene="LOC51032" 
/db_xref= "Locus ID: 51032" 
/coded_by="NM_015849.1:26. .835" 



ORIGIN 



1 mirtlllstl vagalscgvs tyapdmsrml ggeearpnsw pwqvslqyss 
ngqwyhtcgg 

61 slianswvlt aahcisssri yrvmlgqhnl yvaesgslav svskiwhkd 
wnsnqvskgn 

121 diallklanp vsltdkiqla clppagtilp nnypcyvtgw grlqtngalp 
ddlkqgrllv 

181 vdyatcsssg wwgstvktnm icaggdgvic tcngdsggpl ncqasdgrwe 

vhgigsltsv 

241 lgcnyyykps iftrvsnynd winsviann 



45 



50 



55 



PRAME 



NM 006115 



2148 bp ttlRNA 



PRI 



LOCUS 
2001 

DEFINITION Homo sapiens preferentially expressed antigen in 
melanoma (PRAME) , 

mRNA. 

ACCESSION NM_0 06115 

VERSION NM_006115.1 GI : 5174640 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 



19-JUN- 
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Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 



Homo. 
REFERENCE 
AUTHORS 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 



1 (bases 1 to 2148) 

Ikeda,H., Lethe, B., Lehmann,F., Van Baren,N. , 
Baurain , J . - F . , De 

Smet,C, Chanibost,H. , Vitale,M., Moretta,A., Boon,T. and 
Coulie, P.G. 

TITLE Characterization of an antigen that is recognized on a 
melanoma 

showing partial HLA loss by CTL expressing an NK inhibitor 
receptor 

Immunity 6 (2), 199-208 (1997) 
97199265 

2 (bases 1 to 2148) 

Williams JM, Chen GC, Zhu L and Rest RF. 
Using the yeast two-hybrid system to identify human 
epithelial cell 

proteins that bind gonococcal Opa proteins: intracellular 

gonococci 

bind pyruvate kinase via their Opa proteins and 

require host 

pyruvate for growth 
JOURNAL Mol. Microbiol. 27 (1), 171-186 (1998) 
MEDLINE 98125741 
PUBMED 9466265 
REFERENCE 3 (bases 1 to 2148) 

AUTHORS vanBaren,N., Chambost ,H . , Ferrant,A., Michaux,L., Ikeda,H 

Millard,I., Olive, D., Boon,T. and Coulie, P.G. 
TITLE PRAME, a gene encoding an antigen recognized on a 
human melanoma by 

cytolytic T cells, is expressed in acute leukaemia cells 
Br. J. Haematol, 102 (5), 1376-1379 (1998) 
98423996 
9753074 

4 (bases 1 to 2148) 

Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins J] 
Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough 

R, Almeida 

JP, Babbage A, Bagguley C, Bailey J, Barlow K # Bates 
KN, Beasley O, 

Bird CP, Blakey S, Bridgeman AM, Buck D, Burgess J, 

Burr ill WD, 

O'Brien KP and et al. 

The DNA sequence of human chromosome 22 
Nature 402 (6761) , 489-495 (1999) 
20057165 
10591208 

Erratum: [[published erratum appears in Nature 2000 Apr 
20;404 (6780) :904] ] 

REVIEWED REFSEQ: This record has been curated by NCBI 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REMARK 



COMMENT 
staff. The 



amino acid 
a human 



reference sequence was derived from U65011.1. 
Summary: The protein encoded by this gene has a 509 

antigen, lacking a signal sequence, and recognized on 
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melanoma cell line by a T-lymphocyte clone. A 
significant level of 

this mRNA is detected in normal testis as well as in 

many 

5 melanomas, non-small cell lung carcinomas, sarcomas, 

head and neck 

tumors and renal carcinomas- The encoded protein is 

expressed 

predominantly in acute leukemias carrying chromosomal 
10 abnormalities 

such as translocation t(8:21), which fuses the AML1 
and ETO genes. 

Its expression shares several characteristics with the 

expression 

15 patterns of MAGE, BAGE, and GAGE gene families, all of 

which are 

expressed in tumors. This protein is expressed in a 

higher 

proportion of samples than genes of the MAGE, BAGE, 

20 and GAGE 

families, and of these four groups, only this protein 

is expressed 

by acute myeloid leukemias . 
COMPLETENESS: complete on the 3' end. 
25 FEATURES Location/Qualifiers 

source 1. .2148 

/organism= xx Homo sapiens" 
/db_xref =*taxon: 9606" 
/ chromosome = w 2 2 " 
30 /map=*22qll.22" 

/haplotype= w A24 A28 B13 B44 Cw6 Cw7" 
gene 1..2148 

/gene=*PRAME" 
/note="MAPE; OIP4" 
35 /db_xref= w LocusID: 23532" 

/db_xref= tt MIM: 606021" 
CDS 236.. 1765 

/gene= tt PRAME" 

/note= "melanoma antigen preferentially 

40 expressed in 

tumors; Opa- interacting protein OIP4" 
/ codon_start =1 
/db_xref=*LocusID: 23532" 
/ db_xr ef = *MIM : 6 0 6 0 2 1 " 
45 /product^ "preferentially expressed antigen of 



melanoma" 



/protein_id= w NP_006106 . 1 
/db_xref= n GI: 5174641" 



50 / 1 rans la t i on= "MERRRLWGS IQSRYI SMSVWTS PRRLVELAGQSLLKDEALAI AALELLPREL 

FPPLFMAAFDGRHSQTLKAMVQAWPFTCLPLGVLMKGQHI^ 
I^VLDLRKNSHQDFWTVTtfSGNRASLYSFPEP 
EGACDELFSYLIEK\ftCRKKNVliRLCCKKLKIFAMPM 

KFSPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFIiSLQCLQALYVDSLFFLRGRLDQL 
55 LRHVMNPLETLSITNCRLSEGDVMHLSQSPSVSQLSVIjSLSGvMLTDVSPEPLQALLERASATLQD 
LVFDECGITDDQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGLSNLTHVLYPVPLESY 
EDIHGTLHLERI^YLHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPCFMPN" 
variation 254 
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variation 



variation 



variation 



variation 



variation 



variation 



variation 



/allele- 
/allele= w T" 

/db_xref= tt dbSNP : 1129172" 
698 

/allele="C 
/allele="A" 

/db_jxr ef = n dbSNP :1063107" 
1001 

/allele=*C" 
/allele=*T" 

/db_xref= tt dbSNP: 2229695" 
1498 

/allele=*T" 
/allele= u C" 

/ db_xref = "dbSNP : 2 22 9 6 96 " 
1762 

/allele=*C" 
/allele- n T" 

/db_xref== w dbSNP: 13604" 
1952 

/allele= tt A" 
/allele= tt G" 
/db_xref= u dbSNP : 7104" 

1984 

/allele=*A' 

/allele* n C 
/db_xref= u dbSNP : 8405" 

2043 

/alleles W T" 
/allele= W A" 

/db_xref= n dbSNP : 1804685" 
2108. .2113 
2130 

548 c 558 g 508 t 



l" 
I* 



polyA_signal 

polyA — site 

BASE COUNT 534 a 

ORIGIN . 

1 gcttcagggt acagctcccc cgcagccaga agccgggcct 

agcaccgctc 

61 cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca 
qqtgtgcrtga 

121 actctctgag gaaaaaccat tttgattatt actctcagac 
acaagtgact 

181 gagacctaga aatccaagcg ttggaggtcc tgaggccagc 
tcaaaatgga 

241 acgaaggcgt ttgtggggtt ccattcagag ccgatacatc 

tg g9a ^ 9 cccacggaga c ttgtggagc tggcagggca gagcctgctg 
ccctggccat 

.361 tgccgccctg gagttgctgc ccagggagct cttcccgcca 
cagcctttga 

421 cgggagacac agccagaccc tgaaggcaat ggtgcaggcc 

cctgcctccc , . 

" 481 tctgggagtg ctgatgaagg gacaacatct tcacctggag 

ctgtgcttga 4 

541 tggacttgat gtgctccttg cccaggaggt tcgccccagg 

ttcaagtgct . . 

601 ggatttacgg aagaactctc atcaggactt ctggaccgca 

acagggccag 



gcagcccctc 

gcaacttcgc 

gtgcgtggca 

ctaagtcgct 

agcatgagtg 

aaggatgagg 

ctcttcatgg 

tggcccttca 

accttcaaag 

aggtggaaac 

tggtctggaa 
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661 tctgtactca tttccagagc 
gaaaagtaga 

721 tggtttgagc acagaggcag 
tagacctgtt 
5 781 cctcaaggaa ggtgcctgtg 

tgaagcgaaa 

841 gaaaaatgta ctacgcctgt 
ccatgcagga 

901 tatcaagatg atcctgaaaa 
10 aagtgacttg 

961 tacctggaag ctacccacct 
tgattaatct 

1021 gcgtagactc ctcctctccc 
agaaggaaga 
15 1081 gcagtatatc gcccagttca 

aggctctcta 

1141 tgtggactct ttatttttcc 
acgtgatgaa 

1201 ccccttggaa accctctcaa 
20 tgatgcatct 

1261 gtcccagagt cccagcgtca 
tcatgctgac 

1321 cgatgtaagt cccgagcccc 
ccctccagga 
25 1381 cctggtcttt gatgagtgtg 

tgccttccct 

1441 gagccactgc tcccagctta 
ccatatctgc 

1501 cttgcagagt ctcctgcagc 
30 tgctgtatcc 

1561 tgtccccctg gagagttatg 
ggcttgccta 

1621 tctgcatgcc aggctcaggg 
tggtctggct 
35 1681 tagtgccaac ccctgtcctc 

agcccatcct 

1741 gtgcccctgt ttcatgccta 
ttctgcatac 

1801 ttggacacta aagccaggat 
40 acagtttcag 

1861 acaaatgttc agtgtgagtg 
tcagacaaat 

1921 gttcagtgag gaaaaaaagg 
aggagttaat 
45 1981 gtgatctttg gggagataca 

tctaaaggga 

2041 gattctggct tgggaagtac 
gtaaagaaac 

2101 tgttgaaaat aaagagaagc 
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cagaagcagc 


tcagcccatg 


acaaagaagc 


agcagccctt 


cattccagta 


gaggtgctcg 


atgaattgtt 


ctcctacctc 


attgagaaag 


gctgtaagaa 


gctgaagatt 


tttgcaatgc 


tggtgcagct 


ggactctatt 


gaagatttgg 


tggcgaaatt 


ttctccttac 


ctgggccaga 


acatccatgc 


atcttcctac 


atttccccgg 

■ 


cctctcagtt 


cctcagtctg 


cagtgcctgc 


ttagaggccg 


cctggatcag 


ttgctcaggc 


taactaactg 


ccggctttcg 


gaaggggatg 


gtcagctaag 


tgtcctgagt 


ctaagtgggg 


tccaagctct 


gctggagaga 


gcctctgcca 


ggatcacgga 


tgatcagctc 


cttgccctcc 


caaccttaag 


cttctacggg 


aattccatct 


acctcatcgg 


gctgagcaat 


ctgacccacg 


aggacatcca 


tggtaccctc 


cacctggaga 


agttgctgtg 


tgagttgggg 


cggcccagca 


actgtgggga 


cagaaccttc 


tatgacccgg 


actagctggg 


tgcacatatc 


aaatgcttca 


gtgcatgcat 


cttgaagcaa 


caaagcagcc 


aggaaaacat 


gttcagtgag 


gaaaaaacat 


ggaagttggg 


gataggcaga 


tgttgacttg 


tcttatagag 


ttagaaatag 


aatctgaatt 


atgtaggagt 


taatccctgt 


gtagactgtt 


aatgtgaagc 


aaaaaaaaaa 


aaaaaaaa 
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ED-B domain of Fibronectin 



LOCUS HSFIBEDB 2823 bp DNA linear 

PRI 09-AUG-1999 
5 DEFINITION Human fibronectin gene ED-B region. 

ACCESSION X07717 
VERSION X07717.1 GI: 31406 

KEYWORDS alternate splicing; fibronectin. 
SOURCE human. 
10 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

15 REFERENCE 1 (bases 1 to 2823) 

AUTHORS Paolella,G., Henchclif f e, C . , Sebastio,G. and 
Baralle,F.E. 

TITLE Sequence analysis and in vivo expression show that 
alternative 

20 splicing of ED-B and ED -A regions of the human 

fibronectin gene are 

independent events 
JOURNAL Nucleic Acids Res. 16 (8), 3545-3557 (1988) 
MEDLINE 88233940 
25 FEATURES Location/Qualifiers 

source 1 . .2823 

/organism="Homo sapiens" 
/db_xref = " taxon : 96 06 " 
/clone="MA10" 
30 exon 1..104 

/number =1 

/product = " fibronectin" 
CDS join(<2. .104,1375. .1647,2758. .>2823) 

/ c odon_s tart = 1 
35 /products "fibronectin" 

/protein_id="CAB52437.1 H 
/db_xref="GI: 5725425" 

/ 1 rans 1 a t i on= » CTFDNL S PGLE YNVS VYTVKDDKESVPI SDTI I PEVPQLTDLS F 
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VD ITDSS IGLRWTPLNS STI IGYRI TWAAGEGI P I FBDFVDS SVGYYTVTGLE PGID 



YDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTW 



11 



intron 



105. .1374 



exon 



/nunfoer=l 
1375. .1647 



/note="ED-B exon" 
/number=2 

/produc t = " f ibr one c t in " 



intron 



1648. .2757 
/number=2 



exon 



2758. .2823 



BASE COUNT 



824 a 



/number =3 

/product = " f ibronect in 
556 c 528 g 



n 



915 t 



ORIGIN 

1 ctgcactttt gataacctga gtcccggcct ggagtacaat gtcagtgttt 
acactgtcaa 

61 ggatgacaag gaaagtgtcc ctatctctga taccatcatc ccaggtaata 
gaaaataagc 

121 tgctatcctg agagtgacat tccaataaga gtggggatta gcatcttaat 
ccccagatgc 

181 ttaagggtgt caactatatt tgggatttaa ttccgatctc ccagctgcac 
tttccaaaac 

241 caagaagtca aagcagcgat ttggacaaaa tgcttgctgt taacactgct 
ttactgtctg 

3 01 tgcttcactg ggatgctgtg tgttgcagcg agtatgtaat ggagtggcag 
ccatggcttt 

361 aactctgtat tgtctgctca catggaagta tgactaaaac actgtcacgt 
gtctgtactc 

421 agtactgata ggctcaaagt aatatggtaa atgcatccca tcagtacatt 
tctgcccgat 

481 tttacaatcc atatcaattt ccaacagctg cctatttcat cttgcagttt 
caaatccttc 

541 tttttgaaaa ttggatttta aaaaaaagtt aagtaaaagt cacaccttca 

gggttgttct 

601 ttcttgtggc cttgaaagac aacattgcaa aggcctgtcc taaggatagg 
cttgtttgtc 
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661 cattgggtta taacataatg 

gactcttcag 

721 tagaatgctt ttactaacgc 

aaaatagtgg 
5 781 caatggcctt aacctaggcc 

atggcacatt 

841 tcacaccata cattcataat 

gagttttgta 

901 tgggagaaaa aaaatcaatt 

10 tcatgcaagc 

961 ttattttcta ctaaaacagt 
tacttcagat 

1021 attatgtcta ggctctaaga 

tttttaggaa 
15 1081 tctgaacagt tcaaattgta 

atatatatat 

1141 atatatatat tccctaaaag 

aactgttcat 

1201 gatgcttagg aagtcttttc 
20 tcattgcttt 

1261 aacattccac caattttact 
tttcctcttt 

1321 ttttccctct attttccttt 
atagaggtgc 
25 1381 cccaactcac tgacctaagc 

ctgaggtgga 

1441 ccccgctaaa ctcttccacc 
gcaggagaag 

1501 gtatccctat ttttgaagat 
30 gtcacagggc 

1561 tggagccggg cattgactat 
ggcgagagtg 

1621 cccctactac actgacacaa 
ttgagacata 
35 1681 gatggtgttg catgctgcca 

atgggggaag 

1741 tcagcaattg gccaaagatt 
caaatgcatc 
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aaagcattgg acagatcgtg tccccctttg 
taattacatg ttttgattat gaatgaacct 
tgtctttcct cagcctgaat gtgcttttga 
gcattagcgt tatggccatg atgttgtcat 
tatcacccat ttattatttt ttccggttgt 
tttggaatta ttaaaagcat tgctgatact 
atggtttcga catcctaaac agccatatga 
ccctttaagg atgttttcaa aatgtaaaaa 
aatattcctg tttattcttc tagggaagca 
agagaattta aaacagattg catattacca 
actagtaacc tgatatacac tgctttattt 
tgcctccccc tccctttgct ttgtaactca 
tttgttgata taaccgattc aagcatcggc 
attattgggt accgcatcac agtagttgcg 
tttgtggact cctcagtagg atactacaca 
gatatcagcg ttatcactct cattaatggc 
caaacgggtg aattttgaaa acttctgcgt 
ccagttactc cggttaaata tggatgtttc 
cagataggtg gaattggggg gataaggaat 
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1801 tgctaaactg attggagaaa aacacatgca atatcttcag tacactctca 
tttaaaccac 

1861 aagtagatat aaagcctaga gaaatacaga tgtctgctct gttaaatata 
aaatagcaaa 

5 1921 tgttcattca atttgaagac ctagaatttt tcttcttaaa taccaaacac 

gaataccaaa 

1981 ttgcgtaagt accaattgat aagaatatat caccaaaatg taccatcatg 
ctcttccttc 

2041 taccctttga taaactctac catgctcctt ctttgtagct aaaaacccat 
10 caaaatttag 

2101 ggtagagtgg atgggcattg ttttgaggta ggagaaaagt aaacttggga 
ccattctagg 

2161 ttttgttgct gtcactaggt aaagaaacac ctctttaacc acagtctggg 
gacaagcatg 

15 2221 caacatttta aaggttctct gctgtgcatg ggaaaagaaa catgctgaga 

accaatttgc. 

2281 atgaacatgt tcacttgtaa gtagaattca ctgaatggaa ctgtagctct 
agatatctca 

2341 catgggggga agtttaggac cctcttgtct ttttgtctgt gtgcatgtat 
20 ttctttgtaa 

2401 agtactgcta tgtttctctt tgctgtgtgg caacttaagc ctcttcggcc 
tgggataaaa 

2461 taatctgcag tggtattaat aatgtacata aagtcaacat atttgaaagt 
agattaaaat 

25 2521 cttttttaaa tatatcaatg atggcaaaaa ggttaaaggg ggcctaacag 

tactgtgtgt 

2581 agtgttttat ttttaacagt agtacactat aacttaaaat agacttagat 
tagactgttt 

2641 gcatgattat gattctgttt cctttatgca tgaaatattg attttacctt 
30 tccagctact 

2701 tcgttagctt taattttaaa atacattaac tgagtcttcc ttcttgttcg 
aaaccagctg 

2761 ttcctcctcc cactgacctg cgattcacca acattggtcc agacaccatg 
cgtgtcacct 
35 2821 ggg 

// 
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CEA 

LOCUS NM_004363 2974 bp mRNA linear 

PRI 28-NOV-2000 

DEFINITION Homo sapiens carcinoembryonic antigen-related cell 
5 adhesion 

molecule 5 (CEACAM5) , mRNA. 
ACCESSION NM_004363 

VERSION NM_004363.1 GI: 11386170 

KEYWORDS 
10 SOURCE human. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

15 Homo. 

REFERENCE 1 (bases 1 to 2974) 

AUTHORS Oikawa S, Nakazato H and Kosaki G. 

TITLE Primary structure of human carcinoembryonic antigen 

(CEA) deduced 
20 from cDNA sequence 

JOURNAL Biochem. Biophys. Res. Commun. 142 (2), 511-518 (1987) 

MEDLINE 87128144 

PUBMED 3814146 

REFERENCE 2 (bases 1 to 2974) 

25 AUTHORS Zimmermann W, Weber B, Ortlieb B, Rudert F, Schempp W, 

Fiebig HH, 

Shively JE, von Kleist S and Thompson JA. 
TITLE Chromosomal localization of the carcinoembryonic 

antigen gene 

30 family and differential expression in various tumors 

JOURNAL Cancer Res. 48 (9), 2550-2554 (1988) 

MEDLINE 88184584 

PUBMED 3356015 

REFERENCE 3 (bases 1 to 2974) 
35 AUTHORS Barnett,T., Goebel,S.J. # Nothdurft ,M.A. and 

Elting, J. J. 

TITLE carcinoembryonic antigen family: characterization of 

cDNAs coding 
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JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 



for NCA and CEA and suggestion of nonrandom sequence 

variation in 

their conserved loop-domains 
Genomics 3 (1), 59-66.(1988) 
89122014 

4 (bases 1 to 2974) 
Barnett T and Zimmermann W. 

Workshop report: proposed nomenclature for the 

care inoembryoni c 

antigen (CEA) gene family 
JOURNAL Tumour Biol. 11 (1-2) , 59-63 (1990) 
MEDLINE 90176333 
PUBMED 2309067 
REFERENCE 5 (bases 1 to 2974) 

AUTHORS Schrewe H, Thompson J, Bona M, Hefta LJ, Maruya A, 

Hassauer M, 

Shively JE, von Kleist S and Zimmermann W. 
TITLE Cloning of the complete gene for carcinoembryonic 

antigen: analysis 

of its promoter indicates a region conveying cell 

type-specific 

expression 

JOURNAL Mol. Cell. Biol. 10 (6), 2738-2748 (1990) 
MEDLINE 90258861 
PUBMED 2342461 

COMMENT PROVISIONAL REFSEQ : This record has not yet been 

subject to final 

NCBI review. The reference sequence was derived from 



M29540.1. 
FEATURES 



source 



gene 



Location/Qualifiers 
1. .2974 

/organism="Homo sapiens" 
/db_xref= ■ taxon : 96 06 n 
/chromosome= n 19 " 
/map="19ql3 . l-ql3 .2" 
1. -2974 

/gene="CEACAM5" 
/note="CEA; CD66e n 
/db_xref="LocusID:1048 n 
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/db_xref = n MIM: 114890" 
CDS 115.. 2223 

/gene=" CEACAM5 » 

/note="carcinoembryonic antigen" 

/codon_start=l 
/db_xref="LocusID:1048" 
/db_xref = " MIM : 114 8 9 0 " 

/product="carcinoeinbryonic antigen-related 

cell adhesion 

molecule 5" 

/protein_id= «NP_004354 . 1 " 
/db_xref ="GI : 113 86171 " 



/ trans lati on= " MESPSAPPHRWCI PWQRLLLTAS LLTFWNPPTTAKLTIESTPFN 
VAEGKEVLIiLVHNL PQHL FGYSWY KGERVDGNRQ I IGYVIGTQQATPGPAYSGREI IY 
PNASLL I QNI I QNDTGFYTLHVI KSDLVNEEATGQFRVYPEL PKPS I S SNNS KPVEDK 
DAVAFTCE PETQDATYLWWVNNQSLPVS PRLQLSNGNRTLTLFNVTRNDTAS YKCETQ 
NPVSARRSDSVILNVLYGPDAPTISPLNTSYRSGEN^ 

FQQSTQELF I PN I TVNNSGS YTCQAHNS DTGLNRTTVTT I TVYAE PPKPF I T SNNSNP 
VEDEDAVALTCEPE I QNTTYLWWVNNQ S LPVS PRLQL SNDNRTLTLLSVTRNDVGPYE 
CGI QNELSVDHSDPVI LNVLYGPDDPT I S PS YTYYRPGVNLS LS CHAASNPPAQYSWL 
IDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSN 
NSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDA 

RAYVCGIQNSVSANRSDPVTLDVLYGPDTPI I S PPDS SYLSGANLNLSCHSASNPSPQ 
YSWRINGIPQQHTQVLFIAKITPl^GTYACFVSNIiATGRNNSIVKSITVSASGTSPG 

LSAGATVGIM IGVLVGVALI " 

misc_f eature 595 . . 780 

/note= "IG; Region: Immunoglobulin" 

misc_f eature 868 . . 1026 

/note="IG; Region: immunoglobulin" 

misc_f eature 1129 . . 1314 

/note= n IG; Region: Immunoglobulin" 

misc_f eature 1399 . . 1566 

/note="IG; Region: Immunoglobulin" 

miscjE eature 1930 . . 2091 

/note="IG; Region: Immunoglobulin" 

BASE COUNT 840 a 847 C 613 g 674 t 

ORIGIN 
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1 ctcagggcag agggaggaag 

gacaaaacgt 

61 tcctggaact caagctcttc 

gaccatggag 
5 121 tctccctcgg cccctcccca 

gctcacagcc 

181 tcacttctaa ccttctggaa 

atccacgccg 

241 ttcaatgtcg cagaggggaa 

10 ccagcatctt 

301 tttggctaca gctggtacaa 

tataggatat 

361 gtaataggaa ctcaacaagc 

gataatatac 
15 421 cccaatgcat ccctgctgat 

ctacacccta 

481 cacgtcataa agtcagatct 
ggtatacccg 

541 gagctgccca agccctccat 

20 caaggatgct 

601 gtggccttca cctgtgaacc 

ggtaaacaat 

661 cagagcctcc cggtcagtcc 
cctcactcta 
25 721 ttcaatgtca caagaaatga 

cccagtgagt 

781 gccaggcgca gtgattcagt 
ccccaccatt 

841 tcccctctaa acacatctta 
30 ccacgcagcc 

901 tctaacccac ctgcacagta 
atccacccaa 

961 gagctcttta tccccaacat 
ccaagcccat 
35 1021 aactcagaca ctggcctcaa 

tgcagagcca 

1081 cccaaaccct tcatcaccag 
tgctgtagcc 
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gacagcagac cagacagtca cagcagcctt 

tccacagagg aggacagagc agacagcaga 

cagatggtgc atcccctggc agaggctcct 

cccgcccacc actgccaagc tcactattga 

ggaggtgctt ctacttgtcc acaatctgcc 

aggtgaaaga gtggatggca accgtcaaat 

taccccaggg cccgcataca gtggtcgaga 

ccagaacatc atccagaatg acacaggatt 

tgtgaatgaa gaagcaactg gccagttccg 

ctceagcaac aactccaaac ccgtggagga 

tgagactcag gacgcaacct acctgtggtg 

* 

caggctgcag ctgtccaatg gcaacaggac 

cacagcaagc tacaaatgtg aaacccagaa 

catcctgaat gtcctctatg gcccggatgc 

cagatcaggg gaaaatctga acctctcctg 

ctcttggttt gtcaatggga ctttccagca 

cactgtgaat aatagtggat cctatacgtg 

taggaccaca gtcacgacga tcacagtcta 

caacaactcc aaccccgtgg aggatgagga 
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1141 ttaacctgtg aacctgagat 
taatcagagc 

1201 ctcccggtca gtcccaggct 
tctactcagt 

1261 gtcacaagga atgatgtagg 
aagtgttgac 

1321 cacagcgacc cagtcatcct 
catttccccc 

1381 tcatacacct attaccgtcc 
agcctctaac 

1441 ccacctgcac agtattcttg 
acaagagctc 

1501 tttatctcca acatcactga 
caataactca 

1561 gccagtggcc acagcaggac 
gctgcccaag 

1621 ccctccatct ccagcaacaa 
ggccttcacc 

1681 tgtgaacctg aggctcagaa 
gagcctccca 

1741 gtcagtccca ggctgcagct 
caatgtcaca 

1801 agaaatgacg caagagccta 
aaaccgcagt 

1861 gacccagtca ccctggatgt 
ccccccagac 

1921 tcgtcttacc tttcgggagc 
taacccatcc 

1981 ccgcagtatt cttggcgtat 
tctctttatc 

2041 gccaaaatca cgccaaataa 
cttggctact 

2101 ggccgcaata attccatagt 
ttctcctggt 

2161 ctctcagctg gggccactgt 
tgctctgata 

2221 tagcagccct ggtgtagttt 
gcttcttcct 



tcagaacaca acctacctgt ggtgggtaaa 

gcagctgtcc aatgacaaca ggaccctcac 

accctatgag tgtggaatcc agaacgaatt 

gaatgtcctc tatggcccag acgaccccac 

aggggtgaac ctcagcctct cctgccatgc 

gctgattgat gggaacatcc agcaacacac 

gaagaacagc ggactctata cctgccaggc 

tacagtcaag acaatcacag tctctgcgga 

« 

ctccaaaccc gtggaggaca aggatgctgt 

cacaacctac ctgtggtggg taaatggtca 

gtccaatggc aacaggaccc tcactctatt 

tgtatgtgga atccagaact cagtgagtgc 

cctctatggg ccggacaccc ccatcatttc 

gaacctcaac ctctcctgcc actcggcctc 

caatgggata ccgcagcaac acacacaagt 

taacgggacc tatgcctgtt ttgtctctaa 

caagagcatc acagtctctg catctggaac 

cggcatcatg attggagtgc tggttggggt 

cttcatttca ggaagactga cagttgtttt 
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2281 taaagcattt gcaacagcta cagtctaaaa ttgcttcttt accaaggata 
tttacagaaa 

2341 agactctgac cagagatcga gaccatccta gccaacatcg tgaaacccca 
tctctactaa 

2401 aaatacaaaa atgagctggg cttggtggcg cgcacctgta gtcccagtta 
ctcgggaggc 

2461 tgaggcagga gaatcgcttg aacccgggag gtggagattg cagtgagccc 
agatcgcacc 

2521 actgcactcc agtctggcaa cagagcaaga ctccatctca aaaagaaaag 
aaaagaagac 

2581 tctgacctgt actcttgaat acaagtttct gataccactg cactgtctga 
gaatttccaa 

2641 aactttaatg aactaactga cagcttcatg aaactgtcca ccaagatcaa 
gcagagaaaa 

2701 taattaattt catgggacta aatgaactaa tgaggattgc tgattcttta 
aatgtcttgt 

2761 ttcccagatt tcaggaaact ttttttcttt taagctatcc actcttacag 
caatttgata 

2821 aaatatactt ttgtgaacaa aaattgagac atttacattt tctccctatg 
tggtcgctcc 

2881 agacttggga aactattcat gaatatttat attgtatggt aatatagtta 
ttgcacaagt 

2941 tcaataaaaa tctgctcttt gtataacaga aaaa 

// 



Her2/Neu 

LOCUS HUMHER2A 453 0 bp mRNA linear 

PRI 18-SEP-199S 

DEFINITION Human tyrosine kinase- type receptor (HER2) TnRNA, 

complete cds . 

ACCESSION M11730 

VERSION M11730.1 GI: 183986 

KEYWORDS tyrosine kinase. 

SOURCE Homo sapiens (clone: latnbda-HER2-436) fetal cDNA to 

mRNA. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 
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Mammalia; Eutheria; Primates ; Catarrhini ; Hominidae ; 

Homo. 

REFERENCE 1 (bases 1 to 4530) 

AUTHORS Coussens,L., Yang-Feng, T. L. , Liao,Y.-C, Chen,E., 

Gray, A. , 

McGrath,J. , Seeburg,P.H. , Libermann,T.A. , 

Schlessinger, J. , 

Francke,U., Levinson,A. and Ullrich, A. 
TITLE Tyrosine kinase receptor with extensive homology to 

EGF receptor 

shares chromosomal location with neu oncogene 
JOURNAL Science 230 (4730), 1132-1139 (1985) 
MEDLINE 86070181 
REFERENCE 2 (bases 1701 to 1719) 
AUTHORS Ullrich, A. 
JOURNAL Unpublished (1988) 
FEATURES Location/Qualif iers 

source 1 . .4530 

/organism="Homo sapiens" 
/ db_xr e f = ■ taxon : 9 6 0 6 " 
/clone= " lambda-HER2 -436" 
/devjs tage= " fetal " 
mRNA <1..4530 

/products "HER2 mRNA 11 

CDS 151.. 3918 

/note="HER2 receptor" 

/ codon__s t art = 1 
/protein_id="AAA75493 . 1" 

/db_xref = "GI : 3 06840" 

/translations "ME LAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLD 
MLRHL YQGCQ WQGNLE LTYL PTNAS LS F LQD I QEVQGYVL I AHNQ VRQVPLQRLR IV 
RGTQLFEDNYALAVLDNGDPLNNTTPVTGAS PGGLRELQLRS LTE I LKGGVLI QRNPQ 
LCYQDTI LWKD I FHKNNQLALTL I DTNRS RACHPCS PMCKGSRCWGE S SEDCQSLTRT 
V(^GGCAROTGPLPTDCCHEQ(^GCTGPKHSDCLACLHFNHSGICELHCPALVTYNT 

DTFE SMPNPEGRYTFGAS CVTACPYNYL STDVGS CTLVC PLHNQEVTAEDGTQRCEKC 
SKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPL 
QPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGI 
SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG 
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LACHQLCARGHCWGPGPTQ(^CSQFLRGQECOTECRVLQGLPREYVNARHCLPCHPE 
CQPQNGSVTCFGPEADQCVAC^YKDPPFCVARCPSGVKPDLSYMPIWKFP 
PCPINCTHSCVDLDDKGCPAEQRASPLTSIVSAVVGILLVWLGVVFGILIKRRQQKI 
RKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIW^ 
PDGENVKI PVAI KVLRENTS PKANKEILDEAYVMAGVGS PYVSRLLGI CLTSTVQLVT 
QLMPYGCLLDHVRENRGRLGS QDLLNWCMQ I AKGMS YLEDVRLVHRDLAARNVLVKS P 
NHVKI TDFGLARLLD IDETEYHADGGKVPI KWMALES ILRRRFTHQSDVWS YGVTVWE 
LMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYM^ 

VSEFSRMARDPQRFWIQNEDLGPASPLDSTFYRSLIjEDDDMGDLVDAEEYLVPQQGF 

F CPD PAPGAGGMVHHRHRS S STRSGGGDLTLGLE PSEEEAPRS PLAP SEGAGSDVFDG 
DLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR 
PQPPSPREGPLPAARPAGATLERAKTLS PGKNGWKDVFAFGGAVENPEYLTPQGGAA 
PQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV" 

old_sequence 1701 . . 1719 

/citation^ [i] 
BASE COUNT 922 a 1382 c 1346 g 880 t 

ORIGIN Chromosome 17q21-q22. 

1 aattctcgag ctcgtcgacc ggtcgacgag ctcgagggtc gacgagctcg 

agggcgcgcg 

61 cccggccccc acccctcgca gcaccccgcg ccccgcgccc tcccagccgg 
gtccagccgg 

121 agccatgggg ccggagccgc agtgagcacc atggagctgg cggccttgtg 
ccgctggggg 

181 ctcctcctcg ccctcttgcc ccccggagcc gcgagcaccc aagtgtgcac 
cggcacagac 

241 atgaagctgc ggctccctgc cagtcccgag acccacctgg acatgctccg 
ccacctctac 

301 cagggctgcc aggtggtgca gggaaacctg gaactcacct acctgcccac 
caatgccagc 

361 ctgtccttcc tgcaggatat ccaggaggtg cagggctacg tgctcatcgc 
tcacaaccaa 

421 gtgaggcagg tcccactgca gaggctgcgg attgtgcgag gcacccagct 
ctttgaggac 

481 aactatgccc tggccgtgct agacaatgga gacccgctga acaataccac 
ccctgtcaca 

541 ggggcctccc caggaggcct gcgggagctg cagcttcgaa gcctcacaga 
gatcttgaaa 

601 ggaggggtct tgatccagcg gaacccccag ctctgctacc aggacacgat 
tttgtggaag 
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661 gacatcttcc 
ccgctctcgg 

721 gcctgccacc 
gagttctgag 
5 781 gattgtcaga 

caaggggcca 

841 ctgcccactg 
caagcactct 

901 gactgcctgg 
10 ctgcccagcc 

961 ctggtcacct 
ccggtataca 

1021 ttcggcgcca 

cgtgggatcc 

15 1081 tgcaccctcg 

aacacagcgg 

1141 tgtgagaagt 
ggagcacttg 

1201 cgagaggtga 
20 caagaagatc 

1261 tttgggagcc 
caacactgcc 

1321 ccgctccagc 
aggttaccta 
25 1381 tacatctcag 

cctgcaagta 

1441 atccggggac 
gctgggcatc 

1501 agctggctgg 
30 catccaccat 

1561 aacacccacc 
gaacccgcac 

1621 caagctctgc 
gggcctggcc 
35 1681 tgccaccagc 

gtgtgtcaac 

1741 tgcagccagt 
gcaggggctc 



acaagaacaa ccagctggct ctcacactga tagacaccaa 



cctgttctcc gatgtgtaag ggctcccgct gctggggaga 



gcctgacgcg cactgtctgt gccggtggct gtgcccgctg 



actgctgcca tgagcagtgt gctgccggct gcacgggccc 



cctgcctcca cttcaaccac agtggcatct gtgagctgca 



acaacacaga cacgtttgag tccatgccca atcccgaggg 



gctgtgtgac tgcctgtccc tacaactacc tttctacgga 



tctgccccct gcacaaccaa gaggtgacag cagaggatgg 



gcagcaagcc ctgtgcccga gtgtgctatg gtctgggcat 



gggcagttac cagtgccaat atccaggagt ttgctggctg 



tggcatttct gccggagagc tttgatgggg acccagcctc 



cagagcagct ccaagtgttt gagactctgg aagagatcac 



catggccgga cagcctgcct gacctcagcg tcttccagaa 



gaattctgca caatggcgcc tactcgctga ccctgcaagg 



ggctgcgctc actgagggaa ctgggcagtg gactggccct 



tctgcttcgt gcacacggtg ccctgggacc agctctttcg 



tccacactgc caaccggcca gaggacgagt gtgtgggcga 



tgtgcgcccg agggcactgc tggggtccag ggcccaccca 



tccttcgggg ccaggagtgc gtggaggaat gccgagtact 
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1801 cccagggagt 
tcagccccag 

1861 aatggctcag 
tgcccactat 

1921 aaggaccctc 
cctctcctac 

1981 atgcccatct 
catcaactgc 

2041 acccactcct 
agccagccct 

2101 ctgacgtcca 

gggggtggfcc 

2161 tttgggatcc 
gcggagactg 

2221 ctgcaggaaa 
caaccaggcg 

2281 cagatgcgga 
atctggcgct 

2341 tttggcacag 
aattccagtg 

2401 gccatcaaag 
cttagacgaa 

2461 gcatacgtga 
catctgcctg 

2521 acatccacgg 
agaccatgtc 

2581 cgggaaaacc 
gcagattgcc 

2641 aaggggatga 
cgctcggaac 

2701 gtgctggtca 
tcggctgctg 

2761 gacattgacg 
gtggatggcg 

2821 ctggagtcca 
ttatggtgtg 

2881 actgtgtggg 
agcccgggag 



atgtgaatgc caggcactgt 

tgacctgttt tggaccggag 

ccttctgcgt ggcccgctgc 

ggaagt 1 1 c c aga t gaggag 

g t g t ggac c t gga t gacaag 

tcgtctctgc ggtggttggc 

tcatcaagcg acggcagcag 

cggagctggt ggagccgctg 

tcctgaaaga gacggagctg 

tctacaaggg catctggatc 

tgttgaggga aaacacatcc 

tggctggtgt gggctcccca 

tgcagctggt gacacagctt 

gcggacgcct gggctcccag 

gctacctgga ggatgtgcgg 

agagtcccaa ccatgtcaaa 

agacagagta ccatgcagat 

ttctccgccg gcggttcacc 

age t gat gac 1 1 1 1 ggggc c 
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ttgccgtgcc accctgagtg 

gctgaccagt gtgtggcctg 

cccagcggtg tgaaacctga 

ggcgcatgcc agccttgccc 

ggctgccccg ccgagcagag 

attctgetgg tcgtggtctt 

aaga t c egga ag t aca cga t 

acacctagcg gagegatgee 

aggaaggtga aggtgcttgg 

cctgatgggg agaatgtgaa 

cccaaagcca acaaagaaat 

tatgtctccc gccttctggg 

atgccctatg gctgcctctt 

gaectgetga actggtgtat 

ct cgt acaca gggac t tggc 

attacagact tegggctgge 

gggggcaagg tgcccatcaa 

caccagagtg atgtgtggag 

aaaccttacg atgggatccc 
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2941 atccctgacc 
caccattgat 

3001 gtctacatga 

aagattccgg 

3061 gagttggtgt 

ggtcatccag 

3121 aatgaggact 

actgctggag 

3181 gacgatgaca 

gcagggcttc 

3241 ttctgtccag 

ccgcagctca 

3301 tctaccagga 
agaggaggcc 

3361 cccaggtctc 
tggtgacctg 

3421 ggaatggggg 
ccctctacag 

3481 cggtacagtg 
cgttgccccc 

3541 ctgacctgca 
ccagccccct 

3601 tcgccccgag 
ggaaagggcc 

3661 aagactctct 
tgggggtgcc 

3721 gtggagaacc 
ccaccctcct 

3781 cctgccttca 
accagagcgg 

3841 ggggctccac 

gtacctgggt 

3901 ctggacgtgc 

tgtcctcagg 

3961 gagcagggaa 

cgaccacttc 

4021 caggggaacc 

cttgagttcc 



tgctggaaaa gggggagcgg 
tcatggtcaa atgttggatg 
ctgaattctc ccgcatggcc 
tgggcccagc cagtcccttg 
tgggggacct ggtggatgct 
accctgcccc gggcgctggg 
gtggcggtgg ggacctgaca 
cactggcacc ctccgaaggg 
cagccaaggg gctgcaaagc 
aggaccccac agtacccctg 
gcccccagcc tgaatatgtg 
agggccctct gcctgctgcc 
cc c cagggaa gaa t ggggt c 
ccgagtactt gacaccccag 
gcccagcctt cgacaacctc 
ccagcacctt caaagggaca 
cagtgtgaac cagaaggcca 
ggcctgactt ctgctggcat 
tgccatgcca ggaacctgtc 



ctgccccagc cccccatctg 
attgactctg aatgtcggcc 
agggaccccc agcgctttgt 
gacagcacct tctaccgctc 
gaggagtatc tggtacccca 
ggcatggtcc accacaggca 
ctagggctgg agccctctga 
gctggctccg atgtatttga 
ctccccacac atgaccccag 
ccctctgaga ctgatggcta 
aaccagccag atgttcggcc 
cgacctgctg gtgccactct 
gtcaaagacg tttttgcctt 
ggaggagctg cccctcagcc 
tattactggg accaggaccc 
cctacggcag agaacccaga 
agtccgcaga agccctgatg 
caagaggtgg gagggccctc 
ctaaggaacc ttccttcctg 
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4081 cagatggctg gaaggggtcc agcctcgttg gaagaggaac agcactgggg 
agtctttgtg 

4141 gattctgagg ccctgcccaa tgagactcta gggtccagtg gatgccacag 
cccagcttgg 

5 4201 ccctttcctt ccagatcctg ggtactgaaa gccttaggga agctggcctg 

agaggggaag 

4261 cggccctaag ggagtgtcta agaacaaaag cgacccattc agagacfcgtc 
cctgaaacct 

4321 agtactgccc cccatgagga aggaacagca atggtgtcag tatccaggct 

10 ttgtacagag 

4381 tgcttttctg tttagttttt actttttttg ttttgttttt ttaaagacga 

aataaagacc 

4441 caggggagaa tgggtgttgt atggggaggc aagtgtgggg ggtccttctc 
cacacccact 

15 4501 ttgtccattt gcaaatatat tttggaaaac 

// 

SCP-1 

20 LOCUS HSSCP1PRT 3393 bp mRNA linear 

PRI 26-FEB-1997 

DEFINITION H. sapiens mRNA for SCP1 protein. 
ACCESSION X95654 
VERSION X95654.1 GI: 1212982 

25 KEYWORDS SCP1 gene. 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; 

30 Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo . 

REFERENCE 1 (bases 1 to 3393) 

AUTHORS Meuwissen,R.L. , Meerts,I., Hoovers , J . M . , Leschot ,N. J. 

and 

35 Heyting,C. 

TITLE Human synaptonemal complex protein 1 (SCP1) : isolation 

and 

characterization of the cDNA and chromosomal 
localization of the 
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gene 

JOURNAL Genomics 39 (3) , 377-384 (1997) 
MEDLINE 97224467 
REFERENCE 2 (bases 1 to 3393) 
5 AUTHORS Meuwissen,R. J.L. 

TITLE Direct Submission 

JOURNAL Submitted (13-FEB-1996) Dr. R.L.J. Meuwissen, 

Agricultural 

University, Genetics, Dreijenlaan 2, 6703 HA 

10 WAGENINGEN, 

NETHERLANDS 
FEATURES Location/Qualif iers 

source 1. .33 93 

/organism«"Homo sapiens" 
15 / db_xre f ="t axon : 9 6 0 6 " 

/map="lpl2-pl3" 
/ 1 i s sue_type= "testis" 
gene 95.. 3 025 

/gene="SCPl" 
20 CDS 95.. 3 025 

/gene="SCPl" 
/codon_start=l 
/evidence=experimental 
/product= "polypeptide of 976 aa" 
25 /protein_id="CAA64956 .1" 

/db_xref ="GI : 1212983 " 
/dhjxxef = ■ SWISS-PROT : Q1543 1 " 

/trans lat ion= "MEKQKPFALFVPPRSSSSQVSAVKPQTLGGDSTFFKSFNKCTED 
30 DLEFPFAKTNLSKNGENIDSDPALQKVNFLPVLEQVGNSDCHYQEGLKDSDLENSEGL 
SRVFSKLYKEAEKIKKWBCVSTEAELRQKESKLQENRKIIEAQRKAIQELQFGNEKV 
KLE EGI QENKDLI KENNATRHLCNLLKETCARSAEKTKKYEYEREETO 
EKMI TAHGELRVQAENSRLEMHFKLKEDYEKI QHLEQEYKKE INDKEKQVSLLLIQIT 
EKENKMKDLTFLLEESRDKVNQLEEKTKLQSENLKQSIEKQHHLTKELEDIKVSLQRS 
35 VSTQKALEEDLQIATKTICQLTEEKETQMEESNKARAAHSFVVTEFETTVCSLEELLR 
TEQQRLEKNEDQLKILTMELQKKSSELEEMTKLTNNKEVELEELKKVLGEKETLLYEN 
KQFEKIAEELKGTEQELIGLLQAREKEVHDLEIQLTAITTSEQYYSKEVKDLKTELEN 
EKLKNTELTSHCNKLSLENKELTQETSDMTLELKNQQEDINNNKKQEERMLKQIENLQ 

ETETQLRNELEYWEELKQKRDEVKCKLDKSEENCNNLRKQVEN^ 
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LKKKGTAES KQLNVYE I KVNKLEIiELE S AKQKFGE I TDTYQKE I EDKKI SEENLLEEV 
EKAKVIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERDSELGLYKSKEQ 

EQSSLRASLEIELSNLKAELLSVKKQ^^ 

FLLETPEIYWKLDSKAVPSQTVSRNFTSVDHGISKDKRDYLWTSAKNTLSTPLPKAYT 

VKTPTKPKI>QQRENLNIPIEESKKKRKMM?EF 
rNNNPPASHLWKTPKKAPSSLTTPGPT^ 
KLFV" 

BASE COUNT 1440 a 483 c 648 g 822 t 

ORIGIN 

1 gccctcatag accgtttgtt gtagttcgcg tgggaacagc aacccacggt 
ttcccgatag 

61 ttcttcaaag atatttacaa ccgtaacaga gaaaatggaa aagcaaaagc 
cctttgcatt 

121 gttcgtacca ccgagatcaa gcagcagtca ggtgtctgcg gtgaaacctc 
agaccctggg 

181 aggcgattcc actttcttca agagtttcaa caaatgtact gaagatgatt 
tggagtttcc 

241 atttgcaaag actaatctct ccaaaaatgg ggaaaacatt gattcagatc 
ctgctttaca 

301 aaaagttaat ttcttgcccg tgcttgagca ggttggtaat tctgactgtc 
actatcagga 

361 aggactaaaa gactctgatt tggagaattc agagggattg agcagagtgt 
tttcaaaact 

421 gtataaggag gctgaaaaga taaaaaaatg gaaagtaagt acagaagctg 
aactgagaca 

481 gaaagaaagt aagttgcaag aaaacagaaa gataattgaa gcacagcgaa 
aagccattca 

541 ggaactgcaa tttggaaatg aaaaagtaag tttgaaatta gaagaaggaa 
tacaagaaaa 

601 taaagattta ataaaagaga ataatgccac aaggcattta tgtaatctac 
tcaaagaaac 

661 ctgtgctaga tctgcagaaa agacaaagaa atatgaatat gaacgggaag 
aaaccaggca 

721 agtttatatg gatctaaata ataacattga gaaaatgata acagctcatg 
gggaacttcg 

781 tgtgcaagct gagaattcca gactggaaat gcattttaag ttaaaggaag 
attatgaaaa 

841 aatccaacac cttgaacaag aatacaagaa ggaaataaat gacaaggaaa 
agcaggtatc 
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901 actactattg 
catttctgct 

961 agaggaatcc 
agagtgaaaa 

1021 cttaaaacaa 
atattaaagt 

1081 gtcattacaa 
agatagcaac 

1141 aaaaacaatt 
ctaataaagc 

1201 tagagctgct 
gcttggaaga 

1261 attattgaga 
aaatacttac 

1321 catggagctt 
caaataacaa 

1381 agaagtagaa 
ttttatatga 

1441 aaataaacaa 
aactaattgg 

1501 tcttctccaa 
ctgccattac 

1561 cacaagtgaa 
ttgaaaacga 

1621 gaagcttaag 
aaaacaaaga 

1681 gctcacacag 
aagatattaa 

1741 taataacaaa 
aagaaacaga 

1801 aacccaatta 
aaagagatga 

1861 agttaaatgt 
aacaagttga 

1921 aaataaaaac 

aaaaaaaagg 

1981 tacagcagaa 
tagagttaga 



atccaaatca ctgagaaaga 
agagataaag ttaatcaatt 
tcaattgaga aacagcatca 
agaagtgtga gtactcaaaa 
tgtcagctaa ctgaagaaaa 
cattcgtttg tggttactga 
acagaacagc aaagattgga 
caaaagaaat caagtgagct 
cttgaagaat tgaaaaaagt 
tttgagaaga ttgctgaaga 
gccagagaga aagaagtaca 
cagtattatt caaaagaggt 
aatactgaat taacttcaca 
gaaacaagtg atatgaccct 
aagcaagaag aaaggatgtt 
agaaatgaac tagaatatgt 
aaattggaca agagtgaaga 
aagtatattg aagaacttca 
agcaagcaac tgaatgttta 



aaataaaatg aaagatttaa 
agaggaaaag acaaaattac 
tttgactaaa gaactagaag 
ggctttagag gaagatttac 
agaaactcaa atggaagaat 
atttgaaact actgtctgca 
aaaaaat gaa gat caat t ga 
ggaagagatg actaagctta 
c t tgggagaa aaggaaacac 
attaaaagga acagaacaag 
tgatttggaa atacagttaa 
taaagatcta aaaactgagc 
ctgcaacaag ctttcactag 
agaactcaag aatcagcaag 
gaaac aaa t a gaaaa t c 1 1 c 
gagagaagag ctaaaacaga 
aaat t gt aac aat 1 1 aagga 
gcaggagaat aaggccttga 
tgagataaag gtcaataaat 
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2041 actagaaagt gccaaacaga aatttggaga aatcacagac acctatcaga 
aagaaattga 

2101 ggacaaaaag atatcagaag aaaatctttt ggaagaggtt gagaaagcaa 
aagtaatagc 

5 2161 tgatgaagca gtaaaattac agaaagaaat tgataagcga tgtcaacata 

aaatagctga 

2221 aatggtagca cttatggaaa aacataagca ccaatatgat aagatcattg 
aagaaagaga 

2281 ctcagaatta ggactttata agagcaaaga acaagaacag tcatcactga 

10 gagcatcttt 

2341 ggagattgaa ctatccaatc tcaaagctga acttttgtct gttaagaagc 

aacttgaaat 

2401 agaaagagaa gagaaggaaa aactcaaaag agaggcaaaa gaaaacacag 
ctactcttaa 

15 2461 agaaaaaaaa gacaagaaaa cacaaacatt tttattggaa acacctgaaa 

tttattggaa 

2521 attggattct aaagcagttc cttcacaaac tgtatctcga aatttcacat 
cagttgatca 

2581 tggcatatcc aaagataaaa gagactatct gtggacatct gccaaaaata 

20 ctttatctac 

2641 accattgcca aaggcatata cagtgaagac accaacaaaa ccaaaactac 

agcaaagaga 

2701 aaacttgaat atacccattg aagaaagtaa aaaaaagaga aaaatggcct 
ttgaatttga 

25 2761 tattaattca gatagttcag aaactactga tcttttgagc atggtttcag 

aagaagagac 

2821 attgaaaaca ctgtatagga acaataatcc accagcttct catctttgtg 
tcaaaacacc 

2881 aaaaaaggcc ccttcatctc taacaacccc tggacctaca ctgaagtttg 

30 gagctataag 

2941 aaaaatgcgg gaggaccgtt gggctgtaat tgctaaaatg gatagaaaaa 

aaaaactaaa 

3001 agaagctgaa aagttatttg tttaatttca gagaatcagt gtagttaagg 
agcctaataa 

35 3061 cgtgaaactt atagttaata ttttgttctt atttgccaga gccacatttt 

atctggaagt 

3121 tgagacttaa aaaatacttg catgaatgat ttgtgtttct ttatattttt 
agcctaaatg 

■ 
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3181 ttaactacat attgtctgga aacctgtcat tgtattcaga taattagatg 
attatatatt 

3241 gttgttactt tttcttgtat tcatgaaaac tgtttttact aagttttcaa 
atttgtaaag 

5 33 01 ttagcctttg aatgctagga atgcattatt gagggtcatt ctttattctt 

tactattaaa 

3361 atattttgga tgcaaaaaaa aaaaaaaaaa aaa 

// 



10 

SSX-4 

LOCUS NMJ)05636 576 bp mRNA linear 

PRI 10-DEC-2001 

DEFINITION Homo sapiens synovial sarcoma, X breakpoint 4 (SSX4) , 
15 mRNA. 

ACCESSION NM_005636 

VERSION NM_005636.1 GI : 5032122 

KEYWORDS 

SOURCE human . 

20 ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

Homo. 

25 REFERENCE 1 (bases 1 to 576) 

AUTHORS Gure,A.O., Tureci,0., Sahin,U., Tsang,S., 

Scanlan,M. J. , Knuth,A. , 

Pfreundschuh,M. , 01d,L.J. and Chen, Y.T. 
TITLE SSX: a multigene family with several members 

30 transcribed in normal 

testis and human cancer 
JOURNAL Int. J. Cancer 72 (6), 965-971 (1997) 
MEDLINE 98021352 
COMMENT PROVISIONAL REFSEQ: This record has not yet been 

35 subject to final 

NCBI review. The reference sequence was derived from 

U90841.1. 

FEATURES Location/ Qualifiers 

source 1 . . 576 
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/organism="Homo sapiens" 
/db_xref «»taxon : 9606" 
/ chr omos ome = " X " 
/map="Xpll.3 n 
1. .576 

/gene="SSX4» 
/db_xref = " Locus ID: 6759" 
/db_xref="MIM: 300326" 
1. .567 

/gene="SSX4» 

/note="Kruppel-associated box containing SSX 

/ codon_s t art = 1 
/dbjxxef = "LocusID : 6759 n 
/db_xref = "MIM : 3 003 26 " 

/product^ "synovial sarcoma, X breakpoint 4" 
/protein_id="NP_005627 . 1" 
/db_xref="GI: 5032123" 

/ trans lat ion= » MNGDDAFARRPRDDAQI SEKLRKAFDDIAKYFSKKEWEKMKSSEKIVY 
VYMKLNYEVMTKLGFKVT^ 

SLQRIFPKIMPKKPAEEENGLKEVPEASGPQNDGKQLCPPGNPSTLEKINKTSGPKRG 

KHAWTHRLRERKQLWYEE I SDPEEDDE " 
misc_feature 70. .246 

/note="KRAB; Region: krueppel associated box" 

BASE COUNT 187 a 127 C 150 g 112 t 

ORIGIN 

1 atgaacggag acgacgcctt tgcaaggaga cccagggatg atgctcaaat 
atcagagaag 

61 ttacgaaagg ccttcgatga tattgccaaa tacttctcta agaaagagtg 
ggaaaagatg 

121 aaatcctcgg agaaaatcgt ctatgtgtat atgaagctaa actatgaggt 
catgactaaa 

181 ctaggtttca aggtcaccct cccacctttc atgcgtagta aacgggctgc 
agacttccac 

241 gggaatgatt ttggtaacga tcgaaaccac aggaatcagg ttgaacgtcc 
tcagatgact 

301 ttcggcagcc tccagagaat cttcccgaag atcatgccca agaagccagc 
agaggaagaa 
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361 aatggtttga aggaagtgcc agaggcatct ggcccacaaa atgatgggaa 
acagctgtgc 

421 cccccgggaa atccaagtac cttggagaag attaacaaga catctggacc 
caaaaggggg 

5 481 aaacatgcct ggacccacag actgcgtgag agaaagcagc tggtggttta 

tgaagagatc 

541 agcgaccctg aggaagatga cgagtaactc ccctcg 

All patents and publications mentioned in the specification 

10 are indicative of the levels of those skilled in the art to which 

the invention pertains. All patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was specifically and individually indicated to be 
incorporated by reference. 

15 The invention illustratively described herein suitably may be 

practiced in the absence of any element or elements, limitation or 
limitations which is not specifically disclosed herein. The terms 
and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that 

20 in the use of such terms and expressions indicates the exclusion 

of equivalents of the features shown and described or portions 
thereof. It is recognized that various modifications are possible 
within the scope of the invention claimed. Thus, it should be 
understood that although the present invention has been 

25 specifically disclosed by preferred embodiments and optional 

features, modification and variation of the concepts herein 
disclosed may be resorted to by those skilled in the art, and that 
such modifications and variations are considered to be within the 
scope of this invention as defined by the appended claims . 

30 
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WHAT IS CLAIMED IS : 

1. An isolated epitope, comprising a component selected from the group consisting 

of: 

(i) a polypeptide having the sequence as disclosed in TABLE 1 ; 
5 (ii) an epitope cluster comprising the polypeptide of (i); 

(iii) a polypeptide having substantial similarity to (i) or (ii); 

(iv) a polypeptide having functional similarity to any of (i) through (iii); and 

(v) a nucleic acid encoding the polypeptide of any of (i) through (iv). 

2. The epitope of claim 1, wherein the epitope is immunologically active. 

10 3. The epitope of claim 1, wherein the polypeptide is less than about 30 amino acids 

in length. 

4. The epitope of claim 1, wherein the polypeptide is 8 to 10 amino acids in length. 

5. The epitope of claim 1, wherein the substantial or functional similarity comprises 
addition of at least one amino acid. 

15 6. The epitope of claim 5, wherein the at least one additional amino acid is at an N- 

terminus of the polypeptide. 

7. The epitope of claim 1, wherein the substantial or functional similarity comprises a 
substitution of at least one amino acid. 

8. The epitope of claim 1 , the polypeptide having affinity to an HLA-A2 molecule. 
20 9. The epitope of claim 8, wherein the affinity is determined by an assay of binding. 

10. The epitope of claim 8, wherein the affinity is determined by an assay of restriction 
of epitope recognition. 

11. The epitope of claim 8, wherein the affinity is determined by a prediction 
algorithm. 

25 12. The epitope of claim 1, the polypeptide having affinity to an HLA-B7 or HLA-B5 1 

molecule. 

13. The epitope of claim 1 , wherein the polypeptide is a housekeeping epitope. 

14. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 
displayed on a tumor cell. 

30 15. The epitope of claim 1, wherein the polypeptide corresponds to an epitope 

displayed on a neovasculature cell. 

1 6. The epitope of claim 1 , wherein the peptide is an immune epitope. 

17. The epitope of claim 1 wherein the epitope is a nucleic acid. 

18. A pharmaceutical composition comprising the peptide of claim 1 and a 
35 pharmaceutically acceptable adjuvant, carrier, diluent, or excipient 

1 9. The composition of claim 1 8, where the adjuvant is a polynucleotide. 
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20. The composition of claim 19 wherein the polynucleotide comprises a dinucleotide. 

2 1 . The composition of claim 20 wherein the dinucleotide is CpG. 

22. The composition of claim 18, wherein the adjuvant is encoded by a polynucleotide. 

23. The composition of claim 18 wherein the adjuvant is a cytokine. 
5 24. The composition of claim 23 wherein the cytokine is GM-CSF. 

25. The composition of claim 18 further comprising a professional antigen-presenting 
cell (pAPC). 

26. The composition of claim 25, wherein the pAPC is a dendritic cell. 

27. The composition of claim 18, further comprising a second epitope. 

1 0 28. The composition of claim 27, wherein the second epitope is a polypeptide. 

29. The composition of claim 27, wherein the second epitope is a nucleic acid. 

30. The composition of claim 27, wherein the second epitope is a housekeeping 

epitope. 

3 1 . The composition of claim 27, wherein the second epitope is an immune epitope. 

15 32. A pharmaceutical composition comprising the nucleic acid of claim 1 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 

33. A recombinant construct comprising the nucleic acid of Claim 1. 

34. The construct of claim 33, further comprising a plasmid, a viral vector, or an 
artificial chromosome. 

20 35. The construct of claim 33, further comprising a sequence encoding at least one 

feature selected from the group consisting of a second epitope, an IRES, an ISS, an MS, and 
ubiquitin. 

36. A purified antibody that specifically binds to the epitope of claim 1. 

37. A purified antibody that specifically binds to a peptide-MHC protein complex 
25 comprising the epitope of claim 1 . 

38. The antibody of claim 36 or claim 37, wherein the antibody is a monoclonal 
antibody. 

39. A multimeric MHC-peptide complex comprising the epitope of claim 1 . 

40. An isolated T cell expressing a T cell receptor specific for an MHC-peptide 
30 complex, the complex comprising the epitope of claim 1 . 

4 1 . The T cell of claim 40, produced by an in vitro immunization. 

42. The T cell of claim 40, isolated from an immunized animal. 

43 . AT cell clone comprising the T cell of claim 40. 

44. A polyclonal population of T cells comprising the T cell of claim 40. 

35 45. A pharmaceutical composition comprising the T cell of claim 40 and a 

pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 
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46. An isolated protein molecule comprising the binding domain of a T cell receptor 
specific for an MHC-peptide complex, the complex comprising the epitope of claim 1. 

47. The protein of claim 46, wherein the protein is multivalent 

48. An isolated nucleic acid encoding the protein of claim 46. 

5 49. A recombinant construct comprising the nucleic acid of claim 48. 

50. A host cell expressing the recombinant construct of claim 33 or 49. 

51. The host cell of claim 50, wherein the host cell is a dendritic cell, macrophage, 
tumor cell, or tumor-derived cell. 

52. The host cell of claim 50, wherein the host cell is a bacterium, fungus, or 
10 protozoan. 

53. A pharmaceutical composition comprising the host cell of claim 50 and a 
pharmaceutical!/ acceptable adjuvant, carrier, diluent, or excipient. 

54. A vaccine or immunotherapeutic composition comprising at least one component 
selected from the group consisting of the epitope of claim 1; the composition of claim 18, 32, 45, or 

15 53; the construct of claim 33; the T cell of claim 40, and the host cell of claim 50. 

55. A method of treating an animal, comprising: 

administering to an animal the vaccine or immunotherapeutic composition of claim 

54. 

56. The method of claim 55, wherein the administering step comprises a mode of 
20 delivery selected from the group consisting of transdermal, intranodal, perinodal, oral, intravenous, 

intradermal, intramuscular, intraperitoneal, mucosal, aerosol inhalation, and instillation. 

57. The method of claim 55, further comprising a step of assaying to determine a 
characteristic indicative of a state of a target cell or target cells. 

58. The method of claim 57, comprising a first assaying step and a second assaying 
25 step, wherein the first assaying step precedes the administering step, and wherein the second 

assaying step follows the administering step. 

59. The method of claim 58, further comprising a step of comparing the characteristic 
determined in the first assaying step with the characteristic determined in the second assaying step 
to obtain a result. 

30 60. The method of claim 59, wherein the result is selected from the group consisting 

of: evidence of an immune response, a diminution in number of target cells, a loss of mass or size 
of a tumor comprising target cells, a decrease in number or concentration of an intracellular 
parasite infecting target cells. 

61. A method of evaluating immunogenicity of a vaccine or immunotherapeutic 

35 composition, comprising: 
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administering to an animal the vaccine or immunotherapeutic composition of claim 

54; and 

evaluating immunogenicity based on a characteristic of the animal. 
62 . The method of claim 6 1 , wherein the animal is HLA-transgenic. 
5 63. A method of evaluating immunogenicity , comprising: 

in vitro stimulation of a T cell with the vaccine or immunotherapeutic composition 
of claim 54; and 

evaluating immunogenicity based on a characteristic of the T cell. 
64. The method of claim 63, wherein the stimulation is a primary stimulation. 
10 65 . A method of making a passive/adoptive immunotherapeutic, comprising: 

combining the T cell of claim 40 or the host cell of claim 50 with a 
pharmaceutically acceptable adjuvant, carrier, diluent, or excipient. 
66. A method of determining specific T cell frequency comprising the step of 
contacting T cells with a MHC-peptide complex comprising the epitope of claim 1. 
15 67. The method of claim 66, wherein the contacting step comprises at least one 

feature selected from the group consisting of immunization, restimulation, detection, and 
enumeration. 

68. The method of Claim 66, further comprising ELISPOT analysis, limiting dilution 
analysis, flow cytometry, in situ hybridization, the polymerase chain reaction or any combination 

20 thereof. 

69. A method of evaluating immunologic response, comprising the method of claim 66 
carried out prior to and subsequent to an immunization step. 

70. A method of evaluating immunologic response, comprising: 

determining frequency, cytokine production, or cytolytic activity of T cells, prior 
25 to and subsequent to a step of stimulation with MHC-peptide complexes comprising the 

epitope of claim 1. 

71. A method of diagnosing a disease comprising: 

contacting a subject tissue with at least one component selected from the group 
consisting of the T cell of claim 40, the host cell of claim 50, the antibody of claim 36, the 
30 protein of claim 46; and 

diagnosing the disease based on a characteristic of the tissue or of the component. 

72. The method of claim 71, wherein the contacting step takes place in vivo. 

73. The method of claim 71, wherein the contacting step takes place in vitro. 

74. A method of making a vaccine, comprising: 

35 combining at least one component selected from the group consisting of the 

epitope of claim 1; the composition of claim 18, 32, 45, or 53; the construct of claim 33; 
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the T cell of claim 40, and the host cell of claim 50, with a pharmaceutically acceptable 
adjuvant, carrier, diluent, or excipient. 

75. A computer readable medium having recorded thereon the sequence of any one of 
SEQ ID NOS: X -Y, in a machine having a hardware or software that calculates the physical, 
biochemical, immunologic, or molecular genetic properties of a molecule embodying said 
sequence. 

76. A method of treating an animal comprising combining the method of claim 55 
combined with at least one mode of treatment selected from the group of radiation therapy, 
chemotherapy, biochemotherapy, and surgery. 

77. An isolated polypeptide comprising an epitope cluster from a target-associated 
antigen having the sequence as disclosed in Tables 25-44, wherein the amino acid sequence 
consists of not more than about 80% of the amino acid sequence of the antigen. 

78. A vaccine or immunotherapeutic product comprising the polypeptide of claim 78. 

79. An isolated polynucleotide encoding the polypeptide of claim 78. 

80. A vaccine or immunotherapeutic product comprising the polynucleotide of claim 

80. 

8 1 . The polynucleotide of claim 79 or 80, wherein the polynucleotide is DNA. 

82. The polynucleotide of claim 79 or 80, wherein the polynucleotide is RNA. 
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FIG. 7C 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



PCT/US2002/010189 




SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



14/22 



PCT/US2002/010189 




oo 



CD 

u 



B to o 
in & 

0) 

CM 



i i i 

o o o o o 
o o o o 

00 tO ^ (N 



pioy ouiwy |oujd 



[ 



CM 



I 2 

lay 
oo 



CD 
-O 

E 
3 



o 



to 0 

lO CD 

h o 

CD 

cr 



CO 



o o o o 
o o o 

CD CN 



pioy ouituy |ouud 




EE 



E 



CO 









<D 




_Q 




E 




u 
















o 


oo 






O 




0) 




u 


LO 


c 




0) 






IO 


cr 








CO 



o oo o o 
o oo o 

CO CO ^ CN 



pioy ouiiuy |oaid 



[ 

( 



LO 



E 



LO 

ro 

CM 



a> 
E 

3 



_cd 

o 
>^ 

O 

CD 

o 
c 

CD 

cr 

CD 
CO 



oooooo 
ooooo 

OCOCO^CN 



pjoy oujtuy |otud 



0\ 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



PCT/US2002/010189 















-H 












a 






-H 






PQ 


CM 




< 




CO 






CO 




hi 


CO 


-H 




-P 




tn 




O 


<D 


4-> 




CM 




-H 






<+-] 


4-> 


C 


O 


-H 


-H 






PQ 




-rH 




o 


m 




CO 






-I — 1 






























o 






o 









ooooooo o 
o o o o o o o 

cr> m ^ <N 



X|isu9|U| |Ueos8Jon|j udg^i 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



PCT/US2002/010189 



16/22 



Autologous DC Present Al 
Peptide to CD8 T cell 



50 




= 40 



CD 



CO 
00 



30 



05 

E 



c 
a 



20 



10 



□ PSMA 463-471 
El No peptide 



22 



« < 



1 



* ■ • • 

;.. r - 



X 



. la J v-1 




67 



V 1* 



u 1 

/•V 

. • • 

v;. 

■ « 



: •••• ' 



■ a 



.* 



I 




200 



HLA A1 CD8 T Cell Density (x1000/well) 



/7<7. // 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



PCT/US2002/010189 



17/22 

Secretion of IFNgama Was 
Blocked by Anti-Al 

Antibody 



400 




Human CD8 T Cell Density (x 1000/well) 



FIG 12 
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Graphs show lysis of impulsed EL4 cells (open circles) and EL4 
cells pulsed with gp33 peptide (solid triangles). Symbols represe 
individual mice and one of three similar experiments is shown. 

FIG. 15 

SUBSTITUTE SHEET (RULE 26) 



WO 2003/008537 



PCT/US2002/010189 



21/22 



1.5-1 




Days after tumor challenge 



Mean tumor volumes ± 1SD are shown for mice immunized 
with pEFGPL33A DNA (solid circles) or control pEGFP-N3 
DNA (open triangles). Numbers in brackets indicate number 
of mice with tumors/total number of mice in group. One 
of two similar experiments is shown. 
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